InfoType detector reference

The DLP API team releases new infoType detectors and groups periodically. To get the latest list of built-in infoTypes, call the infoTypes.list method of the DLP API.

Global

InfoType Description
AGE

An age measured in months or years.

Detection method: Pattern match

CREDIT_CARD_NUMBER

A credit card number is 12 to 19 digits long. They are used for payment transactions globally.

Detection method: Pattern match and checksum

DOMAIN_NAME

A domain name as defined by the DNS standard.

Detection method: Pattern match and top level domain validation

EMAIL_ADDRESS

An email address indicates the mailbox that emails are sent to or from. The maximum length of the domain name is 255 characters, and the maximum length of the local-part is 64 characters.

Detection method: Pattern and top level domain validation

ETHNIC_GROUP

A person’s ethnic group.

Detection method: Word and phrase list

FIRST_NAME

A first name is defined as the first part of a PERSON_NAME.

Detection method: Custom logic

IBAN_CODE

An International Bank Account Number (IBAN) is defined as an internationally agreed-upon method for identifying bank accounts. It's defined by the International Standard of Organization (ISO) 13616:2007 standard. ISO 13616:2007 was created by the European Committee for Banking Standards (ECBS). An IBAN consists of up to 34 alphanumeric characters including elements such as a country code or account number.

Detection method: Pattern match and checksum

ICD9_CODE

The International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) lexicon is used to assign diagnostic and procedure codes associated with inpatient, outpatient, and physician office use in the United States. It was created by the US National Center for Health Statistics (NCHS). The ICD-9-CM is based on the ICD-9 but provides for additional morbidity detail. It's updated annually on October 1.

Detection method: Word and phrase list

ICD10_CODE

Like ICD-9-CM codes, the International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) lexicon is a series of diagnostic codes published by the World Health Organization (WHO) to describe causes of morbidity and mortality.

Detection method: Word and phrase list

IMEI_HARDWARE_ID

An International Mobile Equipment Identity (IMEI) hardware identifier, used to identify mobile phones.

Detection method: Custom Logic, pattern match and context.

IP_ADDRESS

An Internet Protocol (IP) address (either IPv4 or IPv6).

Detection method: Custom Logic, pattern match and context.

LAST_NAME

A last name is defined as the last part of a PERSON_NAME.

Detection method: Custom logic

LOCATION

A physical address or location—for example, "1600 Amphitheatre Parkway, Mountain View, CA 94043, United States" or "Space Needle."

Detection method: Custom logic

MAC_ADDRESS, MAC_ADDRESS_LOCAL A media access control address (MAC address), which is an identifier for a network adapter.

Detection method: Custom logic, pattern match and context

Context:

  • mac address
  • hardware address
  • physical address
  • hwaddr
  • ether
  • ethernet
  • BSSID
PERSON_NAME

A full person name, which can include first names, middle names or initials, and last names.

Detection method: Custom logic

PHONE_NUMBER, US_TOLLFREE_PHONE_NUMBER

A telephone number or US toll-free telephone number.

Detection method: Custom logic, pattern match and context

SWIFT_CODE

A SWIFT code is the same as a Bank Identifier Code (BIC). It's a unique identification code for a particular bank. These codes are used when transferring money between banks, particularly for international wire transfers. Banks also use the codes for exchanging other messages.

Detection method: Pattern match and context

Context:

  • SWIFT
  • ISO 9362
  • Business Identifier Code
  • BIC
  • Business Entity Identifier
  • BEI
  • bank
  • interbank

Australia

InfoType Description
AUSTRALIA_MEDICARE_NUMBER

A 9-digit Medicare account number is issued to permanent residents of Australia (except for Norfolk island). The primary purpose of this number is to prove Medicare eligibility to receive subsidized care in Australia.

Detection method: Checksum and (pattern match or context)

Context:

  • Medicare
  • Australia
  • IRN
AUSTRALIA_TAX_FILE_NUMBER

An Australian tax file number (TFN) is a number issued by the Australian Tax Office for taxpayer identification. Every taxpaying entity, such as an individual or an organization, is assigned a unique number.

Detection method: Checksum and (pattern match or context)

Context:

  • Tax File Number
  • TFN
  • Australian Tax Office

Brazil

InfoType Description
BRAZIL_CPF_NUMBER

The Cadastro de Pessoas Físicas (CPF) number, or Natural Persons Register number, is an 11-digit number used in Brazil for taxpayer identification.

Detection method: Checksum and (pattern match or context)

Context:

  • CPF
  • Cadastro de Pessoas Físicas
  • Pessoas Físicas
  • Tax Number
  • Taxpayer

Canada

InfoType Description
CANADA_BC_PHN

The British Columbia Personal Health Number (PHN) is issued to citizens, permanent residents, temporary workers, students, and other individuals who are entitled to health care coverage in the Province of British Columbia.

Detection method: Pattern match or 10 digits with context

Context:

  • BC ID
  • PHN
  • British Columbia
  • Personal Health Number
  • Services Card
  • Canadian health insurance number
  • Canadian health ID
CANADA_OHIP

The Ontario Health Insurance Plan (OHIP) number is issued to citizens, permanent residents, temporary workers, students, and other individuals who are entitled to health care coverage in the Province of Ontario.

Detection method: Pattern match and checksum

CANADA_PASSPORT

A Canadian passport number.

Detection method: Pattern match and context

Context:

  • Canada
  • Canadian
  • Numéro de passeport
  • Passport
  • Travel Document
  • document number
CANADA_QUEBEC_HIN

The Quebec Health Insurance Number (HIN) is issued to citizens, permanent residents, temporary workers, students and other individuals who are entitled to health care coverage in the Province of Quebec.

Detection method: Pattern match

CANADA_SOCIAL_INSURANCE_NUMBER

The Canadian Social Insurance Number (SIN) is the main identifier used in Canada for citizens, permanent residents, and those on work or study visas. With a Canadian SIN and mailing address, one can apply for health care coverage, driver's licenses, and other important services.

Detection method: Checksum and (pattern match or context)

China

InfoType Description
CHINA_RESIDENT_ID_NUMBER

A Chinese resident identification number.

Detection method: Pattern match and context

Context:

  • China
  • Chinese
  • Identity Number
  • Resident Number
  • Resident ID
  • ID number
  • 居民身份证
  • 居民身份證
CHINA_PASSPORT

A Chinese passport number.

Detection method: Pattern match and context

Context:

  • China
  • Passport
  • 中华人民共和国护照
  • 护照号
  • Hùzhào hào
  • 护照

France

InfoType Description
FRANCE_CNI

The Carte Nationale d'Identité Sécurisée (CNI or CNIS) is the French national identity card. It's an official identity document consisting of a 12-digit identification number. This number is commonly used when opening bank accounts and when paying by check. It can sometimes be used instead of a passport or visa within the European Union (EU) and in some other countries.

Detection method: Pattern match and context

Context:

  • CNI
  • CNIS (carte nationale d'identité securisée)
  • identité
  • identite
FRANCE_NIR

The Numéro d'Inscription au Répertoire (NIR) is a permanent personal identification number that's also known as the French social security number for services including healthcare as well as pensions.

Detection method: Pattern match and checksum

FRANCE_PASSPORT

A French passport number.

Detection method: Pattern match and context

Context:

  • France
  • Passport
  • Passeport
  • REPUBLIC FRANCAIS
  • Numéro de passeport

Germany

InfoType Description
GERMANY_PASSPORT

A German passport number. The format of a German passport number is 10 alphanumeric characters, chosen from numerals 0-9 and letters C, F, G, H, J, K, L, M, N, P, R, T, V, W, X, Y, Z.

Detection method: Pattern match and context

Context:

  • GERMANY
  • REISEPASS
  • PASSPORT
  • Europäische Union
  • Bundesrepublik
  • Deutschland
  • reisepassnummer

India

InfoType Description
INDIA_PAN_INDIVIDUAL

The Personal Permanent Account Number (PAN) is a unique 10-digit alphanumeric identifier used for identification of individuals, particularly those who pay income tax. It's issued by the Indian Income Tax Department. The PAN is valid for the lifetime of the holder.

Detection method: Pattern match and context

Context:

  • India
  • Account Number
  • PAN
  • Taxpayer ID

Japan

InfoType Description
JAPAN_INDIVIDUAL_NUMBER

Sometimes referred to as "My Number," the Japanese national identification number is a new national ID number as of January 2016.

Context:

  • 個人番号
  • マイナンバー
  • 身分証明書
  • Individual Number
  • My Number
  • Identity Card
JAPAN_PASSPORT

A Japanese passport number. The passport number consists of two alphabetic characters followed by seven digits.

Detection method: Pattern match and context

Context:

  • パスポート
  • パスポート番号
  • Japan
  • Passport

Korea

InfoType Description
KOREA_PASSPORT

A Korean passport number. There are two different formats:

  • Pre-2008 passport numbers consist of 9 characters. The first two characters are the issued local code, corresponding to the holder's gu, or district. The remaining seven digits are the serial number.
  • Post-2008 passport numbers consist of 9 characters. The first character is either a single letter M, denoting PM passports, or the letter S for PS passports. The remaining 8 digits are the serial number.

Detection method: Pattern match and context

Context:

  • 여권
  • 대한민국
  • Passport
  • Korea
KOREA_RRN

A South Korean Social Security Number.

Detection method: Pattern match, checksum and context

Context:

  • 주민등록번호
  • 住民登錄番號
  • korean
  • korea
  • KSSN
  • RRN
  • resident registration
  • registration number
  • social security

Mexico

InfoType Description
MEXICO_CURP_NUMBER

The Mexico Clave Única de Registro de Población (CURP) number, or Unique Population Registry Code or Personal Identification Code number. This is an 18-character state-issued identification number assigned by the Mexican government to citizens or residents of Mexico and used for taxpayer identification.

Detection method: Pattern match and context

Context:

  • CURP
  • Clave Única
  • Población
  • Registro
  • UPRC
  • Personal ID
  • Registry Code
MEXICO_PASSPORT

A Mexican passport number.

Detection method: Pattern match and context

Context:

  • Mexico
  • Passport
  • Pasaporte
  • México
  • Mexican

Netherlands

InfoType Description
NETHERLANDS_BSN_NUMBER

A Netherlands Burgerservicenummer (BSN), or Citizen's Service Number, is a state-issued identification number that's on driver's licenses, passports, and international ID cards.

Detection method: Checksum and (pattern match or context)

Context:

  • BSN
  • Personal Number
  • Burgerservicenummer
  • Netherlands
  • Identification Number
  • Service Number
  • sofinummer
  • sofi
  • personalnummer

Poland

InfoType Description
POLAND_PESEL_NUMBER

The PESEL number is the national identification number used in Poland. It is mandatory for all permanent residents of Poland, as well as for temporary residents staying there longer than 2 months. It is assigned to just one person and cannot be changed.

Detection method: Checksum and (pattern match or context)

Context:

  • PESEL
  • Personal number
  • Personal ID number
POLAND_NATIONAL_ID_NUMBER

The Polish identity card number. is a government identification number for Polish citizens. Every citizen older than 18 years must have an identity card. The card is issued by the local Office of Civic Affairs. Every identity card has its own unique number.

Detection method: Checksum and (pattern match or context)

Context:

  • Identity card
  • Dowód osobisty
  • Numer dowodu
POLAND_PASSPORT

A Polish passport number. Polish passport is an international travel document for Polish citizens. It can also be used as a proof of Polish citizenship.

Detection method: Checksum and (pattern match or context)

Context:

  • Passport
  • Passeport
  • Pass
  • Paszport
  • Poland
  • Polska

Spain

InfoType Description
SPAIN_NIE_NUMBER

The Número de Identificación de Extranjeros (NIE) is an identification number for foreigners living or doing business in Spain. An NIE number is needed for key transactions such as opening a bank account, buying a car, or setting up a mobile phone contract.

Detection method: Checksum and (pattern match or context)

Context:

  • Número de Identificación de Extranjeros
  • NIE
SPAIN_NIF_NUMBER

The Número de Identificación Fiscal (NIF) is a government identification number for Spanish citizens. An NIF number is needed for key transactions such as opening a bank account, buying a car, or setting up a mobile phone contract.

Detection method: Checksum and (pattern match or context)

Context:

  • Número de Identificación Fiscal
  • NIF
SPAIN_PASSPORT

A Spanish Ordinary Passport (Pasaporte Ordinario) number. There are 4 different types of passports in Spain. This detector is for the Ordinary Passport (Pasaporte Ordinario) type, which is issued for ordinary travel, such as vacations and business trips.

Detection method: Pattern match and context

Context:

  • Passport
  • Pasaporte
  • Espana
  • España
  • Spain

United Kingdom

InfoType Description
UK_DRIVERS_LICENSE_NUMBER

A driver's license number for the United Kingdom of Great Britain and Northern Ireland (UK).

Detection method: Pattern match

UK_NATIONAL_HEALTH_SERVICE_NUMBER

A National Health Service (NHS) number is the unique number allocated to a registered user of the three public health services in England, Wales, and the Isle of Man.

Detection method: Pattern match and checksum

UK_NATIONAL_INSURANCE_NUMBER

The National Insurance number (NINO) is a number used in the United Kingdom (UK) in the administration of the National Insurance or social security system. It identifies people, and is also used for some purposes in the UK tax system. The number is sometimes referred to as NI No or NINO.

Detection method: Pattern match (with delimiters) or pattern match and context words

UK_PASSPORT

A United Kingdom (UK) passport number.

Detection method: Pattern match and context

Context:

  • United Kingdom
  • Passport
  • Travel Document
UK_TAXPAYER_REFERENCE

A United Kingdom (UK) Unique Taxpayer Reference (UTR) number. This number, comprised of a string of 10 decimal digits, is an identifier used by the UK government to manage the taxation system. Unlike other identifiers, such as the passport number or social insurance number, the UTR is not listed on official identity cards.

Detection method: Pattern match and context

Context:

  • United Kingdom
  • Taxpayer
  • UTR

United States

InfoType Description
AMERICAN_BANKERS_CUSIP_ID

A Committee on Uniform Security Identification Procedures (CUSIP) number is a 9-character alphanumeric code that identifies a North American financial security.

Detection method: Checksum or context (when check digit not present)

Context: CUSIP

FDA_CODE

The National Drug Code (NDC) is a unique identifier for drug products, mandated in the United States by the Food and Drug Administration (FDA).

Detection method: Word and phrase list

US_ADOPTION_TAXPAYER_IDENTIFICATION_NUMBER

An Adoption Taxpayer Identification Number (ATIN) is a type of Tax Identification Number (TIN), issued by the Internal Revenue Service (IRS) to individuals who are in the process of legally adopting a US citizen or resident child.

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
  • Tax ID
  • Tax identification
  • ATIN
  • TIN
  • Pending TIN
  • Adoption
  • Adoptions
  • Pending US adoption
  • Pending US adoptions
US_BANK_ROUTING_MICR

The American Bankers Association (ABA) Routing Number (also called the transit number) is a nine-digit code. It's used to identify the financial institution that's responsible to credit or entitled to receive credit for a check or electronic transaction.

Detection method: Checksum on 9 digits

Context: The following hotwords:

  • ABA
  • routing
  • transit
  • bank
  • banking
US_DEA_NUMBER

A Drug Enforcement Administration (DEA) number is assigned to a health care provider by the US DEA. It allows the health care provider to write prescriptions for controlled substances. The DEA number is often used as a general "prescriber number" that is a unique identifier for anyone who can prescribe medication.

Detection method: Pattern match and checksum

US_DRIVERS_LICENSE_NUMBER

A driver's license number for the United States. Format can vary depending on the issuing state.

Detection method: Pattern match and context

Context:

  • Drive
  • Driving
  • Learn
  • Lic
  • License
  • Licence
  • Permit
  • DL

Match Quality: Driver's licenses are not well defined and may generate noise results unless there is clear context.

US_EMPLOYER_IDENTIFICATION_NUMBER

An Employer Identification Number (EIN) is also known as a Federal Tax Identification Number, and is used to identify a business entity.

Detection method: Pattern match or 9 digits with context

Context:

  • employer
  • patronal
  • ein
US_HEALTHCARE_NPI

The National Provider Identifier (NPI) is a unique 10-digit identification number issued to health care providers in the United States by the Centers for Medicare and Medicaid Services (CMS). The NPI has replaced the unique provider identification number (UPIN) as the required identifier for Medicare services. It's also used by other payers, including commercial healthcare insurers.

Detection method: Checksum on 10 digits

US_INDIVIDUAL_TAXPAYER_IDENTIFICATION_NUMBER

An Individual Taxpayer Identification Number (ITIN) is a type of Tax Identification Number (TIN), issued by the Internal Revenue Service (IRS). An ITIN is a tax processing number only available for certain nonresident and resident aliens, their spouses, and dependents who cannot get a Social Security Number (SSN).

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Tax ID
  • Tax identification
  • ITIN
  • TIN
  • Individual Tax Identification Number
US_PASSPORT

A United States passport number.

Detection method: Pattern match and context

Context:

  • United States
  • USA
  • Passport
  • Travel
  • Document
US_PREPARER_TAXPAYER_IDENTIFICATION_NUMBER

A Preparer Taxpayer Identification Number (PTIN) is an identification number that all paid tax return preparers must use on US federal tax returns or claims for refund submitted to the Internal Revenue Service (IRS).

Detection method: Pattern match and context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
  • Tax ID
  • Tax identification
  • PTIN
  • TIN
  • Preparer Taxpayer Identification Number
US_SOCIAL_SECURITY_NUMBER

A United States Social Security number (SSN) is a 9-digit number issued to US citizens, permanent residents, and temporary residents. The Social Security number has effectively become the United States national identification number.

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
US_VEHICLE_IDENTIFICATION_NUMBER

A vehicle identification number (VIN) is a unique 17-digit code assigned to every on-road motor vehicle.

Detection method: Checksum and pattern match

Context:

  • VIN
  • Vehicle Identification Number
Was this page helpful? Let us know how we did: