All posts

What GDPR Means for Non-Human Identities

How can you prove that a service account complied with GDPR when it never logs its actions? Why GDPR cares about non‑human identities GDPR treats any processing of personal data as a responsibility that must be demonstrable. The regulation does not draw a line between a human operator and a piece of software that accesses a database. When an automated job, an AI‑driven assistant, or a CI/CD pipeline reads or modifies records that contain personal information, the controller must be able to an

Free White Paper

Non-Human Identity Management + Managed Identities: The Complete Guide

Architecture patterns, implementation strategies, and security best practices. Delivered to your inbox.

Free. No spam. Unsubscribe anytime.

How can you prove that a service account complied with GDPR when it never logs its actions?

Why GDPR cares about non‑human identities

GDPR treats any processing of personal data as a responsibility that must be demonstrable. The regulation does not draw a line between a human operator and a piece of software that accesses a database. When an automated job, an AI‑driven assistant, or a CI/CD pipeline reads or modifies records that contain personal information, the controller must be able to answer three questions:

  • Who performed the operation?
  • What data was accessed or changed?
  • Was the access justified under the lawful basis for processing?

Regulators expect a continuous audit trail, evidence that data minimisation was enforced, and proof that any risky operation received appropriate oversight. In practice, many organisations rely on static service accounts that have wide‑ranging permissions and that never produce per‑request logs. That approach leaves a blind spot in the GDPR evidence chain.

The reality on today’s pipelines

Most teams provision a long‑lived credential for a service account, store the secret in a vault or embed it in a CI configuration, and grant the account broad read/write rights on a database. The job runs, the database returns rows, and the pipeline proceeds without any intermediate checkpoint. The result is a system that works, but it provides no granular record of which identity accessed which row, no way to mask personally identifiable fields before they leave the database, and no approval step for operations that could affect data subjects.

This model satisfies the functional need to move data, yet it provides no granular record of which identity accessed which row, no way to mask personally identifiable fields before they leave the database, and no approval step for operations that could affect data subjects.

Adding non‑human identities is not enough

Introducing a dedicated service account or an AI‑agent identity improves credential hygiene, but the request still travels directly to the target system. The gateway that sits between the identity and the resource is missing, so the following gaps remain:

  • No unified audit that records every command with the identity that issued it.
  • No inline data masking that strips or redacts personal fields before they are sent downstream.
  • No just‑in‑time approval workflow for high‑risk queries.
  • No replay capability for forensic investigations.

In other words, the core GDPR requirements, accountability, data minimisation, and oversight, are still unaddressed. The missing piece is a layer that can observe, control, and record traffic at the protocol level.

Continue reading? Get the full guide.

Non-Human Identity Management + Managed Identities: Architecture Patterns & Best Practices

Free. No spam. Unsubscribe anytime.

hoop.dev as the GDPR‑ready data path

hoop.dev provides exactly that layer. It sits on Layer 7 between the identity (human or non‑human) and the infrastructure target. When a service account authenticates via OIDC, hoop.dev validates the token, extracts group membership, and then proxies the connection to the database, Kubernetes cluster, SSH host, or HTTP service. Because the gateway is the only place the traffic passes, it can enforce the following GDPR‑centric controls:

  • Per‑request audit logs. hoop.dev records each query or command together with the identity that issued it, the timestamp, and the outcome. The logs are stored separately from the target system, providing a reliable audit trail for auditors.
  • Inline data masking. Before a response leaves the target, hoop.dev can redact columns that contain personal data, ensuring that downstream services never see raw identifiers unless explicitly allowed.
  • Just‑in‑time approval. For operations flagged as high‑risk, such as bulk deletes or exports of personal data, hoop.dev routes the request to a human approver. The request proceeds only after approval, satisfying the GDPR principle of purpose limitation.
  • Session recording and replay. Every interaction is captured as a replayable session. Investigators can replay a sequence of commands to understand exactly how data was handled.

Because hoop.dev is the gateway, these outcomes exist only because the gateway is in the data path. Removing hoop.dev would return the system to the unrecorded, unmasked state described earlier.

How the architecture aligns with GDPR evidence requirements

GDPR audits look for three evidence categories: access logs, data‑handling policies, and oversight records. hoop.dev produces each category continuously:

  • Access logs – every session is logged with the exact non‑human identity, satisfying the “who accessed what” demand.
  • Data‑handling policies – inline masking enforces data minimisation at the point of egress, providing proof that only necessary fields were disclosed.
  • Oversight records – approval workflows and session replays create a documented chain of responsibility for high‑impact actions.

The evidence is generated in real time, not as a monthly report, which aligns with GDPR’s requirement for “prompt” availability of records when requested by a supervisory authority.

Getting started with hoop.dev

Deploying hoop.dev is straightforward. The quick‑start guide walks you through a Docker Compose deployment that includes OIDC authentication, masking rules, and guardrails out of the box. Once the gateway is running, you register each target, database, Kubernetes cluster, SSH host, and define the policies that match your GDPR obligations. The open‑source nature of hoop.dev means you can host the gateway inside your own network, keeping all audit data under your control.

For detailed steps, see the getting‑started documentation and the broader learn portal where you can explore masking configuration, approval workflows, and session replay features.

FAQ

Do I need to modify my existing applications?
No. hoop.dev works at the protocol level, so existing clients (psql, kubectl, ssh, etc.) continue to operate unchanged. The only addition is that they connect through the hoop.dev gateway.

Can hoop.dev mask GDPR‑sensitive fields without changing the database schema?
Yes. The gateway inspects the response stream and can redact or replace configured columns before they leave the target, leaving the underlying schema untouched.

How does hoop.dev handle AI‑driven agents that act on personal data?
AI agents authenticate via OIDC just like any other service account. hoop.dev then applies the same masking, logging, and approval policies, ensuring that automated processing is accounted for under GDPR.

Explore the source code and contribute on GitHub.

Open source

Save the open-source gateway for agent data access

Hoop is MIT-licensed infrastructure for controlling how AI agents reach production data. Star hoophq/hoop so you can inspect it, deploy it, or share it when your team starts governing agent access.

Star and save the repo →More posts