June 18, 20264 min read

HIPAA for non-human identities: governing machine access end to end (on Snowflake)

Machine identities that can read or write PHI without real-time oversight expose a health organization to massive HIPAA violations. HIPAA requires covered entities to protect the confidentiality, integrity, and availability of protected health information (PHI). The regulation explicitly calls for audit controls that record who accessed PHI, when, and what was viewed or modified. It also demands that access be limited to the minimum necessary, that data be masked when displayed to unauthorized

Free White Paper

End-to-End Encryption + Non-Human Identity Management: The Complete Guide

Architecture patterns, implementation strategies, and security best practices. Delivered to your inbox.

Coleman Nye

Machine identities that can read or write PHI without real-time oversight expose a health organization to massive HIPAA violations.

HIPAA requires covered entities to protect the confidentiality, integrity, and availability of protected health information (PHI). The regulation explicitly calls for audit controls that record who accessed PHI, when, and what was viewed or modified. It also demands that access be limited to the minimum necessary, that data be masked when displayed to unauthorized parties, and that any high‑risk operation be reviewed before execution. Those safeguards apply not only to human users but also to automated services, data pipelines, and AI agents that connect to data warehouses such as Snowflake.

Why existing machine‑identity setups fall short

In many organizations, non‑human identities are provisioned as long‑lived service accounts. The account is granted a broad Snowflake role, the credential is stored in a secret manager, and any application that needs data simply reads the secret and connects directly to Snowflake. This pattern satisfies the “who can start” question – the identity is known to the IdP – but it provides no enforcement on the data path. The connection bypasses any gateway that could inspect queries, enforce masking, or require an approval step. Consequently, every SELECT, INSERT, or UPDATE runs unchecked, and the audit logs produced by Snowflake alone often lack the granularity required by HIPAA because they cannot tie a query to a specific business justification or capture a human approval record.

The risk is amplified when the same credential is reused across multiple pipelines, test environments, and ad‑hoc scripts. A compromised secret instantly grants an attacker unrestricted read/write access to PHI, and the breach can continue unnoticed because there is no session recording or real‑time alerting. Even without a breach, the organization cannot demonstrate that it limited each machine’s access to the minimum necessary, nor can it prove that sensitive fields were masked before they left the warehouse.

The precondition for HIPAA‑aligned machine access

To meet HIPAA, a health‑care operation must first ensure that every request for Snowflake data passes through a control point that can enforce policy. The precondition is therefore a “gateway‑in‑the‑middle” architecture: an identity‑aware proxy that sits between the IdP‑issued token and the Snowflake endpoint. This proxy must be able to read the token, map the identity to a least‑privilege Snowflake role, and then apply runtime safeguards. Without that proxy, the request still reaches Snowflake directly, leaving the audit trail incomplete, the data unmasked, and the approval workflow absent.

In other words, the setup of OIDC authentication and minimal Snowflake roles is necessary but never sufficient. The enforcement point must be on the data path, not on the client or on the identity provider.

hoop.dev as the data‑path enforcement point

hoop.dev fulfills exactly that requirement. It is a Layer 7 gateway that intercepts every Snowflake connection after the user or service has been authenticated by an OIDC or SAML provider. The gateway validates the token, extracts group membership, and then maps the machine identity to a Snowflake role that grants only the permissions needed for the specific workload. Because hoop.dev sits in the data path, it can inspect each SQL statement before it reaches Snowflake.

Continue reading? Get the full guide.

End-to-End Encryption + Non-Human Identity Management: Architecture Patterns & Best Practices

Free. No spam. Unsubscribe anytime.

When a query matches a policy that marks the operation as high‑risk – for example, a SELECT that includes a PHI column or an INSERT that writes to a protected table – hoop.dev can pause the request and trigger a just‑in‑time approval workflow. An authorized reviewer receives the query context, approves or denies it, and the gateway forwards the request only on approval. This satisfies HIPAA’s “review and approval of access” requirement for automated processes.

What enforcement outcomes hoop.dev provides for HIPAA

Session recording: hoop.dev records every Snowflake session, capturing the full query text, timestamps, and the identity that issued it. The recordings can be accessed for audit purposes, providing a clear view of who accessed PHI.
Inline data masking: For queries that return PHI to a downstream service that should not see the raw values, hoop.dev can mask or redact the sensitive fields in real time, ensuring that only the minimum necessary data leaves the warehouse.
Just‑in‑time approval: High‑impact queries are automatically routed to a human approver. The approval decision is logged alongside the session, providing the evidence of “review” that HIPAA mandates.
Command blocking: Policies can outright reject statements that attempt to drop tables, alter schemas, or export large volumes of PHI, preventing accidental or malicious data loss.
Audit trail generation: hoop.dev emits a detailed audit log that includes identity, request time, approved actions, and masking outcomes. These logs can be forwarded to a SIEM or compliance platform, forming the core of the evidence set needed for HIPAA audits.

Each of these outcomes exists only because hoop.dev occupies the data path. If the gateway were removed, the Snowflake connection would revert to the insecure pattern described earlier, and none of the above safeguards would be present.

How the solution is assembled

The first step is to define a non‑human identity in your IdP – for example, a service account named etl‑pipeline that authenticates via OIDC. Assign that identity to a minimal Snowflake role that only permits SELECT on the tables required for the pipeline. Next, deploy the hoop.dev gateway using the Docker Compose quick‑start or a Kubernetes manifest. Register the Snowflake connection in hoop.dev, providing the Snowflake endpoint and the service account’s credential (which hoop.dev stores securely). Finally, author a policy that marks any query touching PHI columns as “requires approval” and enables inline masking for those columns.

All of the above configuration is documented in the getting‑started guide and the feature reference. The key architectural point is that the identity and role assignment happen in the setup phase, but the enforcement – audit, masking, approval, and blocking – only occurs when hoop.dev sits in the data path.

Getting started

To try this pattern in your environment, follow the step‑by‑step walkthrough in the getting‑started documentation. The guide shows how to create an OIDC client, register a Snowflake target, and write a simple policy that masks SSN fields. For deeper insight into each capability, explore the learn section of the site.

When you are ready to review the source code, contribute, or fork the project, visit the repository on GitHub: View the source on GitHub.

FAQ

Does hoop.dev make my Snowflake instance HIPAA‑certified?

No. hoop.dev does not claim certification. It generates the audit evidence and enforcement controls that help you satisfy HIPAA’s technical safeguards.

Can I use hoop.dev with other data warehouses?

Yes. hoop.dev supports multiple database connectors, including PostgreSQL, MySQL, and others. The same architectural pattern applies: place hoop.dev between the identity provider and the target, then define policies for masking and approval.

What happens if a machine identity is compromised?

Because every session is recorded and every high‑risk query requires approval, an attacker would be forced to trigger an approval workflow for any privileged operation. The recorded session also provides forensic evidence for post‑incident analysis.

Open source

Save the open-source gateway for agent data access

Hoop is MIT-licensed infrastructure for controlling how AI agents reach production data. Star hoophq/hoop so you can inspect it, deploy it, or share it when your team starts governing agent access.

Star and save the repo →More posts