The DLP API uses a few different classification methods to identify sensitive information. These include:
- Contextual Analysis: The presence of relevant strings in proximity to a pattern, a checksum matching string, or both.
- Pattern Matching: A specific alphanumeric pattern (not just string length), including delimiters, valid position, and valid range checks.
- Checksum: A checksum computation and verification with check digit.
- Word and phrase list: A full or partial match to an entry found in a dictionary of words and phrases.
Predefined detectors are not a 100% accurate detection method. For example, they can’t guarantee compliance with regulatory requirements. You must decide what data is sensitive and how to best protect it. Google recommends that you test your settings to make sure your configuration meets your requirements.