Least-privilege access for AI agents on Kubernetes

Least-privilege access for AI agents on Kubernetes usually fails quietly, and the same way every time: someone grants a wildcard to ship a feature, the agent works, and the over-grant never gets walked back. By the time anyone looks, an agent that was supposed to restart pods can read every secret in the cluster. Start with the mistakes, because the correct model is mostly what you get from refusing to make them.

The RBAC mistakes that bite first

Wildcards in the Role. A rule with verbs: ["*"] or resources: ["*"] is least-privilege in name only. It is the single most common over-grant, and it is invisible until an incident reads it back to you.
ClusterRoleBinding where a RoleBinding would do. Binding an agent at the cluster scope when it only needs one namespace hands it the whole cluster. The blast radius is now everything, not one workload.
Standing access with no expiry. RBAC grants do not time out. A scope the agent needed once on Tuesday is still live next quarter, sitting there as resting privilege the agent never gives back.
Reusing a human's permissions. Pointing the agent at an engineer's group bindings means the agent inherits everything that person can do, which is almost never the least the task needs.
No record of what was actually used. Without a session-level record of which verbs and resources the agent touched, you cannot tighten the grant later, so it only ever grows.

Every one of these has the same root. The privilege is defined once, statically, and then never reconciled against what the task actually requires this minute.

What least-privilege has to mean for an agent

Turn the mistakes around and the model appears. Least-privilege access for an agent means the grant carries the narrowest set of verbs and resources the task needs, scoped to a namespace rather than the cluster, ending on its own rather than persisting, tied to the agent's own identity rather than a human's, and recorded so it can be tightened. Static RBAC can express the first two. It struggles badly with the last three, because RBAC has no native notion of "for this task, for the next ten minutes."

The model that holds the line

The architectural requirement is that the privilege boundary sits where the agent cannot widen it. If the agent can edit its own RBAC, or mint its own access, least-privilege is a suggestion. The boundary has to be enforced by something outside the agent process. That is what hoop.dev is built to do. The agent reaches the Kubernetes API through hoop.dev, a Layer 7 access gateway and identity-aware proxy in front of the cluster's access path. Access is brokered per task against the agent's identity, the scope is enforced at the gateway where the agent cannot reconfigure it, and each session is recorded outside the agent. Least-privilege stops being a static RBAC object you hope nobody widened and becomes a property of the access path. The learn pages on per-task access explain the model, and the getting-started docs for fronting a Kubernetes cluster cover the connection.

Continue reading? Get the full guide.

Least Privilege Principle + Kubernetes API Server Access: Architecture Patterns & Best Practices

Free. No spam. Unsubscribe anytime.

This is the difference between defining least privilege and enforcing it. One sits in a YAML file anyone with edit rights can change. The other is checked on every command, at a boundary the agent does not control.

A before and after

Before: the agent holds a ClusterRole with get and list on secrets across all namespaces, because one task once needed to read a TLS secret. The grant is permanent and cluster-wide. After: the agent requests access for that task, receives a grant scoped to read one secret in one namespace for the length of the task, and the session records the read. An attempt to list secrets elsewhere is denied and recorded. The privilege existed only while the task ran, and only where the task pointed.

Pitfalls that survive even a good setup

Scope without expiry. A perfectly narrow grant that never ends is still standing access. Time is half of least privilege.
Approving every grant by reflex. If everything routes to an approver, they rubber-stamp. Reserve approval for the scopes that earn the friction and let low-risk reads pass.

FAQ

Doesn't Kubernetes RBAC already give me least-privilege access?

RBAC lets you express narrow verbs and resources, which is necessary. It does not give you per-task, expiring grants tied to an agent identity, and it can be widened by anyone with edit rights. Enforcing the boundary outside the agent closes that gap.

How narrow is narrow enough?

As narrow as the task, no wider. If a task needs to read one config map, the grant covers that config map, not the namespace, and it ends when the task does.

hoop.dev is open source. To enforce least-privilege access in front of your Kubernetes cluster, start with the repository on GitHub.