An offboarded contractor left a CI pipeline that spins up a headless Chrome instance to scrape user profiles for a marketing dashboard. The pipeline still runs nightly, pulling names, emails, and phone numbers while the team lacks visibility into who triggered the job or what data was extracted. When a data‑subject request arrives, the team cannot prove whether anyone accessed the information, let alone delete it from logs.
Headless browsers are powerful automation tools, but they behave like any other client that can read and write personal data. Under Brazil's General Data Protection Law (lgpd), every processing activity must be documented, purpose‑limited, and protected against unauthorized exposure. The law obliges organizations to maintain detailed audit trails, apply technical safeguards such as masking, and enforce strict access controls that grant permission only for a defined purpose and time window.
In a typical setup, a developer writes a script that launches Chrome in headless mode, authenticates to a web application, and extracts fields that contain personal identifiers. The developer runs the script with a static service‑account token that has broad read permissions. Because the script contacts the target site directly, no gate intercepts the traffic. The team does not record the execution, does not mask the data, and any accidental over‑collection goes unnoticed until an audit or regulator raises a question.
What lgpd expects from automated data collection
lgpd defines several technical and organizational measures that organizations must apply when they process personal data:
- Accountability: organizations must demonstrate how data is accessed, by whom, and for what purpose.
- Purpose limitation and data minimization: organizations may collect only the data necessary for the declared purpose.
- Security of processing: organizations must protect personal data with encryption, masking, or other techniques that prevent unauthorized disclosure.
- Auditability: organizations must capture who performed each operation, the exact request parameters, and the response payloads that contain personal information.
- Right to erasure and rectification: when a data‑subject request arrives, organizations must locate and delete or correct the relevant records.
Meeting these requirements with a headless browser alone is impossible because the browser itself does not provide built‑in governance. Teams need a control plane that sits between the automation script and the target service, enforcing policies at the protocol level.
How hoop.dev helps meet lgpd requirements
hoop.dev acts as a layer‑7 gateway that intercepts every request from a headless browser before it reaches the target web application. The gateway becomes the only place where enforcement can happen, turning the data path into a policy enforcement point.
Setup determines who may request access. Identity providers such as Okta or Azure AD issue OIDC tokens that identify the automation service account. The token tells hoop.dev which principal is requesting a session, but the token itself does not grant any permissions.
The data path is the gateway itself. Teams route all HTTP traffic from the headless browser through hoop.dev. Because the gateway parses each request and response, it can apply lgpd‑specific controls in real time.
