All posts

GDPR for non-human identities: governing machine access end to end (on AWS)

When a data‑processing pipeline leaks personal data because a service account accessed a database without oversight, the organization faces fines that dwarf the original development effort. GDPR makes it clear that every controller must be able to demonstrate who accessed personal data, why, and what was returned. Non‑human identities, such as service accounts, CI/CD bots, and other machine credentials, are the workhorses of modern cloud environments. IAM grants them broad permissions to keep p

Free White Paper

End-to-End Encryption + Non-Human Identity Management: The Complete Guide

Architecture patterns, implementation strategies, and security best practices. Delivered to your inbox.

Free. No spam. Unsubscribe anytime.

When a data‑processing pipeline leaks personal data because a service account accessed a database without oversight, the organization faces fines that dwarf the original development effort. GDPR makes it clear that every controller must be able to demonstrate who accessed personal data, why, and what was returned.

Non‑human identities, such as service accounts, CI/CD bots, and other machine credentials, are the workhorses of modern cloud environments. IAM grants them broad permissions to keep pipelines fast, but they rarely rotate and their activity logs become too noisy. The result is an audit trail that cannot answer the GDPR questions of accountability, data minimisation, and lawful processing.

The typical AWS setup provisions an IAM role or OIDC‑derived token, assigns it to a service account, and lets the machine connect directly to the target resource – a database, an SSH host, or an internal HTTP API. This setup decides who may start a session, but it provides no runtime enforcement. No request gets inspected for PII, no command stops a dangerous operation, and no human can approve a risky action at the moment it occurs.

Because the connection bypasses any central gateway, every request travels straight to the backend. The backend sees the raw credentials, the request payload, and the response data. There is no place to apply inline masking of personal identifiers, no just‑in‑time approval workflow, and no built‑in session recording. Consequently, GDPR evidence scatters across CloudTrail, database logs, and application traces, making it hard for auditors to prove compliance.

How gdpr evidence is built with a gateway

hoop.dev provides a Layer 7, identity‑aware proxy that sits between non‑human identities and the AWS resources they need. By positioning itself in the data path, hoop.dev becomes the only point where traffic can be inspected, altered, and recorded before it reaches the target.

When a machine presents an OIDC or SAML token, hoop.dev validates the token, extracts group membership, and then forwards the request using credentials that only the gateway knows. The original service account never sees the backend credential, and the backend never sees the original token. This separation lets hoop.dev enforce policies that IAM alone cannot achieve.

Continue reading? Get the full guide.

End-to-End Encryption + Non-Human Identity Management: Architecture Patterns & Best Practices

Free. No spam. Unsubscribe anytime.

Enforcement outcomes that matter for gdpr

  • Session recording. hoop.dev records every command and response in its audit log. Auditors can replay a session to see exactly which rows were read and what data was returned.
  • Inline data masking. hoop.dev redacts sensitive fields such as email addresses, social‑security numbers, or credit‑card numbers in real time before they leave the gateway. The logs retain the masked version, satisfying the data‑minimisation principle.
  • Just‑in‑time approval. hoop.dev routes high‑risk operations – for example, a bulk export of a user table – to a human reviewer who must approve the action before it executes.
  • Command blocking. hoop.dev rejects dangerous commands (e.g., DROP DATABASE), preventing accidental data loss that could trigger GDPR penalties.

Because hoop.dev is the sole enforcement point, each of these outcomes exists only because the gateway sits in the data path. The IAM policies that granted the service account access remain unchanged; they merely decide whether a session can start.

Continuous evidence for auditors

GDPR requires organisations to maintain records of processing activities (Article 30) and to be able to demonstrate accountability (Article 5). hoop.dev continuously accrues evidence that satisfies both requirements:

  1. hoop.dev timestamps every session and links it to the originating identity.
  2. Masked response payloads prove that personal data was handled according to the data‑minimisation rule.
  3. Approval events create a clear audit trail of who authorised a risky operation.
  4. Replayable logs allow auditors to verify that no unauthorised data was extracted.

Teams can export these logs from the gateway, index them, and query them with standard tools, making the evidence ready for a GDPR audit without stitching together CloudTrail, database logs, and application‑specific traces.

Operational benefits beyond compliance

Implementing hoop.dev reduces the blast radius of a compromised service account. If a credential leaks, the attacker still must pass through the gateway, where policies can block suspicious commands and mask any PII that would otherwise be exposed.

The continuous audit trail also speeds up responses to data‑subject requests. Because the gateway knows exactly which sessions accessed a particular identifier, teams locate the relevant records quickly and demonstrate the lawful basis for processing.

Finally, the approach aligns with the principle of “privacy by design”. By enforcing masking and approval at the network edge, organisations embed GDPR safeguards directly into the runtime environment rather than relying on after‑the‑fact checks.

Getting started

To adopt this model, provision the hoop.dev gateway in your AWS environment, configure OIDC authentication, and register the resources you need to protect – databases, SSH hosts, or internal HTTP APIs. Detailed instructions are available in the getting‑started guide. The open‑source repository on GitHub provides the Docker Compose quick‑start and Helm charts for production deployments: github.com/hoophq/hoop. For deeper insight into masking, approvals, and session replay, explore the learn section of the documentation.

FAQ

How does hoop.dev help with GDPR’s “right to be forgotten”?hoop.dev masks personal data in responses, ensuring that only the minimal necessary information ever stores in audit logs. While the gateway does not delete the original data in downstream systems, it provides a clear, auditable record of what was accessed and can prove that the retained logs contain no raw PII.Does hoop.dev replace existing IAM policies?No. IAM decides which non‑human identity may start a session. hoop.dev adds runtime controls – masking, approval, recording – that IAM cannot enforce on its own.Can hoop.dev protect any AWS service?hoop.dev proxies connections to databases, SSH hosts, RDP, and internal HTTP services that run on AWS. It does not act as an AWS control‑plane service, but it can protect any workload reachable through the supported protocols.

Open source

Save the open-source gateway for agent data access

Hoop is MIT-licensed infrastructure for controlling how AI agents reach production data. Star hoophq/hoop so you can inspect it, deploy it, or share it when your team starts governing agent access.

Star and save the repo →More posts