Databricks handles massive datasets at scale. Without precise rules for PII detection, confidential data can leak into analytics outputs, machine learning models, or shared reports. Built-in capabilities, plus custom logic, allow scanning for sensitive fields like Social Security numbers, phone numbers, and financial data. Regex-based detection and pattern matching help