All posts

GDPR for non-human identities: governing machine access end to end (on on-prem)

Many assume GDPR only governs the personal data of natural people, not the credentials that machines use to talk to each other. The regulation actually covers any personal data that a non‑human identity processes, which means service accounts, bots and automated scripts are also subject to the same accountability, purpose‑limitation and audit obligations. GDPR defines a data controller’s duty to demonstrate that personal data is accessed only for a lawful purpose, that access is proportionate,

Free White Paper

End-to-End Encryption + Non-Human Identity Management: The Complete Guide

Architecture patterns, implementation strategies, and security best practices. Delivered to your inbox.

Free. No spam. Unsubscribe anytime.

Many assume GDPR only governs the personal data of natural people, not the credentials that machines use to talk to each other. The regulation actually covers any personal data that a non‑human identity processes, which means service accounts, bots and automated scripts are also subject to the same accountability, purpose‑limitation and audit obligations.

GDPR defines a data controller’s duty to demonstrate that personal data is accessed only for a lawful purpose, that access is proportionate, and that a complete audit trail is retained. Auditors will ask for concrete artifacts: who (or what) accessed the data, when the access occurred, which fields were read or modified, and whether any data‑subject rights (such as erasure or rectification) were respected. When machines act on behalf of users, those artifacts must still be available, otherwise the controller cannot prove compliance.

Why traditional machine‑to‑machine access falls short of gdpr expectations

In many on‑prem environments, engineers provision a long‑lived service account, embed its secret in configuration files, and grant it broad rights to databases, Kubernetes clusters or internal APIs. The account is then used by CI pipelines, monitoring agents and automated remediation tools. This pattern creates three compliance gaps:

  • Missing attribution. The service account is a single identity; auditors cannot tell which pipeline or which developer triggered a specific query.
  • Unlimited standing privilege. The credential never expires and can be reused across unrelated workloads, violating the principle of data minimisation.
  • No immutable evidence. Logs are often stored on the same host that runs the workload, making them vulnerable to tampering and providing no guarantee that every command was captured.

Because the enforcement point is usually the target system itself, the controller cannot insert additional guardrails without modifying each downstream service. The result is a fragile audit surface that does not satisfy GDPR’s “accountability” clause.

How an identity‑aware gateway satisfies gdpr evidence requirements

Placing hoop.dev, a Layer 7 gateway, in the data path creates a single, immutable enforcement boundary for every machine‑initiated request. The gateway authenticates the caller via OIDC or SAML, then proxies the connection to the target resource. Because all traffic passes through hoop.dev, it can record, mask and approve actions before they reach the backend.

When non‑human identities are routed through hoop.dev, the following gdpr artifacts become automatically available:

  • Full session logs. hoop.dev records each request and response with timestamps, the originating service account, and the user‑level identity that the account represents. The logs are stored separate from the target host, providing a reliable audit trail.
  • Just‑in‑time (JIT) access. hoop.dev issues short‑lived tokens after an approval workflow. Auditors can see the exact approval chain that granted the privilege.
  • Inline data masking. hoop.dev applies real‑time redaction to sensitive personal fields such as email or SSN. The masking policy is recorded, proving that the controller applied data‑minimisation before the data left the system.
  • Command‑level approval. hoop.dev routes risky operations such as bulk deletes or schema changes to a human reviewer. The approval record ties the action to a specific business justification, satisfying purpose‑limitation requirements.

Because hoop.dev never exposes the underlying credential to the caller, the “agent never sees the secret” principle eliminates credential leakage and supports the GDPR requirement to protect personal data at rest and in transit.

Continue reading? Get the full guide.

End-to-End Encryption + Non-Human Identity Management: Architecture Patterns & Best Practices

Free. No spam. Unsubscribe anytime.

Architectural breakdown: setup, data path, and enforcement outcomes

Setup. Identity providers (Okta, Azure AD, Google Workspace) issue OIDC tokens for each service account. The tokens carry group membership that maps to business roles. This step decides who the request is and whether it may start, but on its own does not enforce any data‑level policy.

The data path. hoop.dev sits between the service account and the target resource, be it a PostgreSQL database, a Kubernetes API server or an SSH host. All packets flow through hoop.dev, making it the only place where enforcement can be applied.

Enforcement outcomes. Because hoop.dev controls the data path, it records every session, masks personal fields, requires JIT approval for privileged commands, and blocks disallowed statements. Those outcomes exist only because hoop.dev is present in the path.

Implementing gdpr‑ready machine access with hoop.dev

hoop.dev embodies the identity‑aware gateway described above. It proxies connections to databases, Kubernetes clusters, SSH hosts and internal HTTP services, while applying the enforcement outcomes needed for gdpr compliance. The product is open source, MIT licensed, and can be self‑hosted on‑premise.

To get started, deploy the gateway using the official Docker Compose quick‑start, configure OIDC authentication, and register each target resource. Detailed guidance is available in the getting‑started documentation and the broader learn portal. Once deployed, define masking rules for personal columns, enable just‑in‑time token issuance, and configure approval workflows for high‑risk commands. From that point forward, every machine‑initiated request is logged, masked where required, and approved according to policy, providing the concrete evidence auditors demand.

FAQ

Do I need to replace existing service accounts?

No. hoop.dev can wrap existing credentials. The gateway holds the secret and presents short‑lived tokens to callers, so the original account remains unchanged while gaining gdpr‑compliant controls.

How long are the audit logs retained?

Retention is a policy decision made by the controller. hoop.dev stores logs in a configurable backend, allowing you to keep evidence for the period required by gdpr (typically six years).

Can hoop.dev mask data for databases I don’t control?

Yes. Because masking happens at the protocol layer inside hoop.dev, the underlying database does not need to support column‑level redaction. The gateway rewrites response payloads before they reach the client.

By moving authentication, authorization and audit to a single, immutable gateway, organizations can turn opaque machine identities into transparent, accountable actors that satisfy gdpr’s evidence requirements.

Explore the open‑source code on GitHub to see how the gateway is built and to contribute your own compliance extensions.

Open source

Save the open-source gateway for agent data access

Hoop is MIT-licensed infrastructure for controlling how AI agents reach production data. Star hoophq/hoop so you can inspect it, deploy it, or share it when your team starts governing agent access.

Star and save the repo →More posts