What the sensitive information types in Exchange 2013 look for
Applies to: Exchange Server 2013
Data loss prevention (DLP) includes sensitive information types that are ready for you to use in your DLP policies. This topic lists the sensitive information types in Exchange Server 2013, and shows what a DLP policy looks for when it detects each type.
A sensitive information type is defined by a pattern that can be identified by a regular expression or a function. Additionally, corroborative evidence such as keywords and checksums can be used to identify a sensitive information type. Confidence level and proximity are also used in the evaluation process.
ABA Routing Number
Format
Nine digits, which may be in a formatted or unformatted pattern.
Pattern
Formatted:
- Four digits beginning with 0, 1, 2, 3, 6, 7, or 8
- A hyphen
- Four digits
- A hyphen
- A digit
Unformatted:
- Nine consecutive digits beginning with 0, 1, 2, 3, 6, 7, or 8
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_aba_routing
finds content that matches the pattern. - A keyword from Keyword_ABA_Routing is found.
<!-- ABA Routing Number -->
<Entity id="cb353f78-2b72-4c3c-8827-92ebe4f69fdf" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_aba_routing" />
<Match idRef="Keyword_ABA_Routing" />
</Pattern>
</Entity>
Keywords
Keyword_ABA_Routing
- aba
- aba #
- aba routing #
- aba routing number
- aba#
- abarouting#
- aba number
- abaroutingnumber
- american bank association routing #
- american bank association routing number
- americanbankassociationrouting#
- americanbankassociationroutingnumber
- bank routing number
- bankrouting#
- bankroutingnumber
- routing transit number
- RTN
Australia bank account number
Format
6-10 digits with or without a bank state branch number
Pattern
Account number is 6-10 digits.
Australia bank state branch number:
- Three digits
- A hyphen
- Three digits
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_australia_bank_account_number
finds content that matches the pattern.. - A keyword from Keyword_australia_bank_account_number is found.
- The regular expression
Regex_australia_bank_account_number_bsb
finds content that matches the pattern.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_australia_bank_account_number
finds content that matches the pattern.. - A keyword from Keyword_australia_bank_account_number is found.
<!-- Australia Bank Account Number -->
<Entity id="74a54de9-2a30-4aa0-a8aa-3d9327fc07c7" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Regex_australia_bank_account_number" />
<Match idRef="Keyword_australia_bank_account_number" />
<Match idRef="Regex_australia_bank_account_number_bsb" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_australia_bank_account_number" />
<Match idRef="Keyword_australia_bank_account_number" />
</Pattern>
</Entity>
Keywords
Keyword_australia_bank_account_number
- swift bank code
- correspondent bank
- base currency
- usa account
- holder address
- bank address
- information account
- fund transfers
- bank charges
- bank details
- banking information
- full names
- iaea
Australia driver's license number
Format
Nine letters and digits
Pattern
Nine letters and digits:
- Two digits or letters (not case sensitive)
- Two digits
- Five digits or letters (not case sensitive)
OR
- 1-2 optional letters (not case sensitive)
- 4-9 digits
OR
- Nine digits or letters (not case sensitive)
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_australia_drivers_license_number
finds content that matches the pattern. - A keyword from Keyword_australia_drivers_license_number is found.
- No keyword from Keyword_australia_drivers_license_number_exclusions is found.
<!-- Australia Drivers License Number -->
<Entity id="1cbbc8f5-9216-4392-9eb5-5ac2298d1356" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_australia_drivers_license_number" />
<Match idRef="Keyword_australia_drivers_license_number" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_australia_drivers_license_number_exclusions" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_australia_drivers_license_number
- international driving permits
- australian automobile association
- international driving permit
- DriverLicence
- DriverLicences
- Driver Lic
- Driver Licence
- Driver Licences
- DriversLic
- DriversLicence
- DriversLicences
- Drivers Lic
- Drivers Lics
- Drivers Licence
- Drivers Licences
- Driver'Lic
- Driver'Lics
- Driver'Licence
- Driver'Licences
- Driver' Lic
- Driver' Lics
- Driver' Licence
- Driver' Licences
- Driver'sLic
- Driver'sLics
- Driver'sLicence
- Driver'sLicences
- Driver's Lic
- Driver's Lics
- Driver's Licence
- Driver's Licences
- DriverLic#
- DriverLics#
- DriverLicence#
- DriverLicences#
- Driver Lic#
- Driver Lics#
- Driver Licence#
- Driver Licences#
- DriversLic#
- DriversLics#
- DriversLicence#
- DriversLicences#
- Drivers Lic#
- Drivers Lics#
- Drivers Licence#
- Drivers Licences#
- Driver'Lic#
- Driver'Lics#
- Driver'Licence#
- Driver'Licences#
- Driver' Lic#
- Driver' Lics#
- Driver' Licence#
- Driver' Licences#
- Driver'sLic#
- Driver'sLics#
- Driver'sLicence#
- Driver'sLicences#
- Driver's Lic#
- Driver's Lics#
- Driver's Licence#
- Driver's Licences#
Keyword_australia_drivers_license_number_exclusions
- aaa
- DriverLicense
- DriverLicenses
- Driver License
- Driver Licenses
- DriversLicense
- DriversLicenses
- Drivers License
- Drivers Licenses
- Driver'License
- Driver'Licenses
- Driver' License
- Driver' Licenses
- Driver'sLicense
- Driver'sLicenses
- Driver's License
- Driver's Licenses
- DriverLicense#
- DriverLicenses#
- Driver License#
- Driver Licenses#
- DriversLicense#
- DriversLicenses#
- Drivers License#
- Drivers Licenses#
- Driver'License#
- Driver'Licenses#
- Driver' License#
- Driver' Licenses#
- Driver'sLicense#
- Driver'sLicenses#
- Driver's License#
- Driver's Licenses#
Australia Medical Account Number
Format
10-11 digits
Pattern
10-11 digits:
- First digit is in the range 2-6
- Ninth digit is a check digit
- 10th digit is the issue digit
- 11th digit (optional) is the individual number
Checksum
Yes
Definition
A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_australian_medical_account_number
finds content that matches the pattern. - A keyword from Keyword_Australia_Medical_Account_Number is found.
- The checksum passes.
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_australian_medical_account_number
finds content that matches the pattern. - The checksum passes.
<!-- Australia Medical Account Number -->
<Entity id="104a99a0-3d3b-4542-a40d-ab0b9e1efe63" recommendedConfidence="85" patternsProximity="300">
<Pattern confidenceLevel="95">
<IdMatch idRef="Func_australian_medical_account_number"/>
<Any minMatches="1">
<Match idRef="Keyword_Australia_Medical_Account_Number"/>
</Any>
</Pattern>
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_australian_medical_account_number"/>
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_Australia_Medical_Account_Number"/>
</Any>
</Pattern>
</Entity>
Keywords
Keyword_Australia_Medical_Account_Number
- bank account details
- medicare payments
- mortgage account
- bank payments
- information branch
- credit card loan
- department of human services
- local service
- medicare
Australia Passport Number
Format
A letter followed by seven digits
Pattern
A letter (not case sensitive) followed by seven digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_australia_passport_number
finds content that matches the pattern. - A keyword from Keyword_passport or Keyword_australia_passport_number is found.
<!-- Australia Passport Number -->
<Entity id="29869db6-602d-4853-ab93-3484f905df50" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_australia_passport_number" />
<Any minMatches="1">
<Match idRef="Keyword_passport" />
<Match idRef="Keyword_australia_passport_number" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_passport
- Passport Number
- Passport No
- Passport #
- Passport#
- PassportID
- Passportno
- passportnumber
- パスポート
- パスポート番号
- パスポートのNum
- パスポート #
- Numéro de passeport
- Passeport n °
- Passeport Non
- Passeport #
- Passeport#
- PasseportNon
- Passeportn °
Keyword_australia_passport_number
- passport
- passport details
- immigration and citizenship
- commonwealth of australia
- department of immigration
- residential address
- department of immigration and citizenship
- visa
- national identity card
- passport number
- travel document
- issuing authority
Australia Tax File Number
Format
8-9 digits
Pattern
8-9 digits typically presented with spaces as follows:
- Three digits
- An optional space
- Three digits
- An optional space
- 2-3 digits where the last digit is a check digit
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_australian_tax_file_number
finds content that matches the pattern. - No keyword from Keyword_Australia_Tax_File_Number or Keyword_number_exclusions is found.
- The checksum passes.
<!-- Australia Tax File Number -->
<Entity id="e29bc95f-ff70-4a37-aa01-04d17360a4c5" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="95">
<IdMatch idRef="Func_australian_tax_file_number" />
<Any minMatches="1">
<Match idRef="Keyword_Australia_Tax_File_Number" />
</Any>
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_number_exclusions" />
</Any>
</Pattern>
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_australian_tax_file_number" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_Australia_Tax_File_Number" />
<Match idRef="Keyword_number_exclusions" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_Australia_Tax_File_Number
- australian business number
- marginal tax rate
- medicare levy
- portfolio number
- service veterans
- withholding tax
- individual tax return
- tax file number
Keyword_number_exclusions
- 00000000
- 11111111
- 22222222
- 33333333
- 44444444
- 55555555
- 66666666
- 77777777
- 88888888
- 99999999
- 000000000
- 111111111
- 222222222
- 333333333
- 444444444
- 555555555
- 666666666
- 777777777
- 888888888
- 999999999
- 0000000000
- 1111111111
- 2222222222
- 3333333333
- 4444444444
- 5555555555
- 6666666666
- 7777777777
- 8888888888
- 9999999999
Canada Bank Account Number
Format
Seven or 12 digits
Pattern
A Canada Bank Account Number is 7 or 12 digits.
A Canada bank account transit number is:
- Five digits
- A hyphen
- Three digits OR
- A zero "0"
- Eight digits
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_canada_bank_account_number
finds content that matches the pattern. - A keyword from Keyword_canada_bank_account_number is found.
- The regular expression
Regex_canada_bank_account_transit_number
finds content that matches the pattern.
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_canada_bank_account_number
finds content that matches the pattern. - A keyword from Keyword_canada_bank_account_number is found.
<!-- Canada Bank Account Number -->
<Entity id="552e814c-cb50-4d94-bbaa-bb1d1ffb34de" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Regex_canada_bank_account_number" />
<Match idRef="Keyword_canada_bank_account_number" />
<Match idRef="Regex_canada_bank_account_transit_number" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_canada_bank_account_number" />
<Match idRef="Keyword_canada_bank_account_number" />
</Pattern>
</Entity>
Keywords
Keyword_canada_bank_account_number
- canada savings bonds
- canada revenue agency
- canadian financial institution
- direct deposit form
- canadian citizen
- legal representative
- notary public
- commissioner for oaths
- child care benefit
- universal child care
- canada child tax benefit
- income tax benefit
- harmonized sales tax
- social insurance number
- income tax refund
- child tax benefit
- territorial payments
- institution number
- deposit request
- banking information
- direct deposit
Canada driver's license number
Format
Varies by province
Pattern
Various patterns covering Alberta, British Columbia, Manitoba, New Brunswick, Newfoundland/Labrador, Nova Scotia, Ontario, Prince Edward Island, Quebec, and Saskatchewan
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_[province_name]_drivers_license_number
finds content that matches the pattern. - A keyword from Keyword_[province_name]_drivers_license_name is found.
- A keyword from Keyword_canada_drivers_license is found.
<!-- Canada Driver's License Number -->
<Entity id="37186abb-8e48-4800-ad3c-e3d1610b3db0" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_alberta_drivers_license_number" />
<Match idRef="Keyword_alberta_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_british_columbia_drivers_license_number" />
<Match idRef="Keyword_british_columbia_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_manitoba_drivers_license_number" />
<Match idRef="Keyword_manitoba_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_new_brunswick_drivers_license_number" />
<Match idRef="Keyword_new_brunswick_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_newfoundland_labrador_drivers_license_number" />
<Match idRef="Keyword_newfoundland_labrador_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_nova_scotia_drivers_license_number" />
<Match idRef="Keyword_nova_scotia_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_ontario_drivers_license_number" />
<Match idRef="Keyword_ontario_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_prince_edward_island_drivers_license_number" />
<Match idRef="Keyword_prince_edward_island_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_quebec_drivers_license_number" />
<Match idRef="Keyword_quebec_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_saskatchewan_drivers_license_number" />
<Match idRef="Keyword_saskatchewan_drivers_license_name" />
<Match idRef="Keyword_canada_drivers_license" />
</Pattern>
</Entity>
Keywords
Keyword_[province_name]_drivers_license_name
- The province abbreviation, for example AB
- The province name, for example Alberta
Keyword_canada_drivers_license
- DL
- DLS
- CDL
- CDLS
- DriverLic
- DriverLics
- DriverLicense
- DriverLicenses
- DriverLicence
- DriverLicences
- Driver Lic
- Driver Lics
- Driver License
- Driver Licenses
- Driver Licence
- Driver Licences
- DriversLic
- DriversLics
- DriversLicence
- DriversLicences
- DriversLicense
- DriversLicenses
- Drivers Lic
- Drivers Lics
- Drivers License
- Drivers Licenses
- Drivers Licence
- Drivers Licences
- Driver'Lic
- Driver'Lics
- Driver'License
- Driver'Licenses
- Driver'Licence
- Driver'Licences
- Driver' Lic
- Driver' Lics
- Driver' License
- Driver' Licenses
- Driver' Licence
- Driver' Licences
- Driver'sLic
- Driver'sLics
- Driver'sLicense
- Driver'sLicenses
- Driver'sLicence
- Driver'sLicences
- Driver's Lic
- Driver's Lics
- Driver's License
- Driver's Licenses
- Driver's Licence
- Driver's Licences
- Permis de Conduire
- id
- ids
- idcard number
- idcard numbers
- idcard #
- idcard #s
- idcard card
- idcard cards
- idcard
- identification number
- identification numbers
- identification #
- identification #s
- identification card
- identification cards
- identification
- DL#
- DLS#
- CDL#
- CDLS#
- DriverLic#
- DriverLics#
- DriverLicense#
- DriverLicenses#
- DriverLicence#
- DriverLicences#
- Driver Lic#
- Driver Lics#
- Driver License#
- Driver Licenses#
- Driver License#
- Driver Licences#
- DriversLic#
- DriversLics#
- DriversLicense#
- DriversLicenses#
- DriversLicence#
- DriversLicences#
- Drivers Lic#
- Drivers Lics#
- Drivers License#
- Drivers Licenses#
- Drivers Licence#
- Drivers Licences#
- Driver'Lic#
- Driver'Lics#
- Driver'License#
- Driver'Licenses#
- Driver'Licence#
- Driver'Licences#
- Driver' Lic#
- Driver' Lics#
- Driver' License#
- Driver' Licenses#
- Driver' Licence#
- Driver' Licences#
- Driver'sLic#
- Driver'sLics#
- Driver'sLicense#
- Driver'sLicenses#
- Driver'sLicence#
- Driver'sLicences#
- Driver's Lic#
- Driver's Lics#
- Driver's License#
- Driver's Licenses#
- Driver's Licence#
- Driver's Licences#
- Permis de Conduire#
- id#
- ids#
- idcard card#
- idcard cards#
- idcard#
- identification card#
- identification cards#
- identification#
Canada Health Service Number
Format
10 digits
Pattern
10 digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_canada_health_service_number
finds content that matches the pattern. - A keyword from Keyword_canada_health_service_number is found.
<!-- Canada Health Service Number -->
<Entity id="59c0bf39-7fab-482c-af25-00faa4384c94" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_canada_health_service_number" />
<Any minMatches="1">
<Match idRef="Keyword_canada_health_service_number" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_canada_health_service_number
- personal health number
- patient information
- health services
- speciality services
- automobile accident
- patient hospital
- psychiatrist
- workers compensation
- disability
Canada passport number
Format
Two uppercase letters followed by six digits
Pattern
Two uppercase letters followed by six digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_canada_passport_number
finds content that matches the pattern. - A keyword from Keyword_canada_passport_number or Keyword_passport is found.
<!-- Canada Passport Number -->
<Entity id="14d0db8b-498a-43ed-9fca-f6097ae687eb" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_canada_passport_number" />
<Any minMatches="1">
<Match idRef="Keyword_canada_passport_number" />
<Match idRef="Keyword_passport" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_canada_passport_number
- canadian citizenship
- canadian passport
- passport application
- passport photos
- certified translator
- canadian citizens
- processing times
- renewal application
Keyword_passport
- Passport Number
- Passport No
- Passport #
- Passport#
- PassportID
- Passportno
- passportnumber
- パスポート
- パスポート番号
- パスポートのNum
- パスポート#
- Numéro de passeport
- Passeport n °
- Passeport Non
- Passeport #
- Passeport#
- PasseportNon
- Passeportn °
Canada Personal Health Identification Number (PHIN)
Format
Nine digits
Pattern
Nine digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_canada_phin
finds content that matches the pattern. - At least two keywords from Keyword_canada_phin or Keyword_canada_provinces are found.
<!-- Canada PHIN -->
<Entity id="722e12ac-c89a-4ec8-a1b7-fea3469f89db" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_canada_phin" />
<Any minMatches="2">
<Match idRef="Keyword_canada_phin" />
<Match idRef="Keyword_canada_provinces" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_canada_phin
- social insurance number
- health information act
- income tax information
- manitoba health
- health registration
- prescription purchases
- benefit eligibility
- personal health
- power of attorney
- registration number
- personal health number
- practitioner referral
- wellness professional
- patient referral
- health and wellness
Keyword_canada_provinces
- Nunavut
- Quebec
- Northwest Territories
- Ontario
- British Columbia
- Alberta
- Saskatchewan
- Manitoba
- Yukon
- Newfoundland and Labrador
- New Brunswick
- Nova Scotia
- Prince Edward Island
- Canada
Canada Social Insurance Number
Format
Nine digits with optional hyphens or spaces
Pattern
Formatted:
- Three digits
- A hyphen or space
- Three digits
- A hyphen or space
- Three digits
Unformatted:
- Nine digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_canadian_sin
finds content that matches the pattern.At least two of any combination of the following items:
- A keyword from Keyword_sin is found.
- A keyword from Keyword_sin_collaborative is found.
- The function
Func_eu_date
finds a date in the right date format.
The checksum passes.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_unformatted_canadian_sin
finds content that matches the pattern. - A keyword from Keyword_sin is found.
- The checksum passes.
<!-- Canada Social Insurance Number -->
<Entity id="a2f29c85-ecb8-4514-a610-364790c0773e" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_canadian_sin" />
<Any minMatches="2">
<Match idRef="Keyword_sin" />
<Match idRef="Keyword_sin_collaborative" />
<Match idRef="Func_eu_date" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_unformatted_canadian_sin" />
<Match idRef="Keyword_sin" />
</Pattern>
</Entity>
Keywords
Keyword_sin
- sin
- social insurance
- numero d'assurance sociale
- sins
- ssn
- ssns
- social security
- numero d'assurance social
- national identification number
- national id
- sin#
- soc ins
- social ins
Keyword_sin_collaborative
- driver's license
- drivers license
- driver's licence
- drivers licence
- DOB
- Birthdate
- Birthday
- Date of Birth
Drug Enforcement Agency (DEA) Number
Format
Two letters followed by seven digits
Pattern
Pattern must include all of the following items:
- One letter (not case sensitive) from this set of possible letters: abcdefghjklmnprstux, which is a registrant code
- One letter (not case sensitive), which is the first letter of the registrant's last name
- Seven digits, the last of which is the check digit
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_dea_number
finds content that matches the pattern. - The checksum passes.
<!-- DEA Number -->
<Entity id="9a5445ad-406e-43eb-8bd7-cac17ab6d0e4" recommendedConfidence="85" patternsProximity="300">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_dea_number"/>
</Pattern>
</Entity>
Keywords
None
EU Debit Card Number
Format
16 digits
Pattern
Complex and robust pattern
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_eu_debit_card
finds content that matches the pattern.At least one of the following statements is true:
- A keyword from Keyword_eu_debit_card is found.
- A keyword from Keyword_card_terms_dict is found.
- A keyword from Keyword_card_security_terms_dict is found.
- A keyword from Keyword_card_expiration_terms_dict is found.
- The function
Func_expiration_date
finds a date in the right date format.
The checksum passes.
<!-- EU Debit Card Number -->
<Entity id="0e9b3178-9678-47dd-a509-37222ca96b42" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_eu_debit_card" />
<Any minMatches="1">
<Match idRef="Keyword_eu_debit_card" />
<Match idRef="Keyword_card_terms_dict" />
<Match idRef="Keyword_card_security_terms_dict" />
<Match idRef="Keyword_card_expiration_terms_dict" />
<Match idRef="Func_expiration_date" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_eu_debit_card
- account number
- card number
- card no.
- security number
- cc#
Keyword_card_terms_dict
- acct nbr
- acct num
- acct no
- american express
- americanexpress
- americano espresso
- amex
- atm card
- atm cards
- atm kaart
- atmcard
- atmcards
- atmkaart
- atmkaarten
- bancontact
- bank card
- bankkaart
- card holder
- card holders
- card num
- card number
- card numbers
- card type
- cardano numerico
- cardholder
- cardholders
- cardnumber
- cardnumbers
- carta bianca
- carta credito
- carta di credito
- cartao de credito
- cartao de crédito
- cartao de debito
- cartao de débito
- carte bancaire
- carte blanche
- carte bleue
- carte de credit
- carte de crédit
- carte di credito
- carteblanche
- cartão de credito
- cartão de crédito
- cartão de debito
- cartão de débito
- cb
- ccn
- check card
- check cards
- checkcard
- checkcards
- chequekaart
- cirrus
- cirrus-edc-maestro
- controlekaart
- controlekaarten
- credit card
- credit cards
- creditcard
- creditcards
- debetkaart
- debetkaarten
- debit card
- debit cards
- debitcard
- debitcards
- debito automatico
- diners club
- dinersclub
- discover
- discover card
- discover cards
- discovercard
- discovercards
- débito automático
- edc
- eigentümername
- european debit card
- hoofdkaart
- hoofdkaarten
- in viaggio
- japanese card bureau
- japanse kaartdienst
- jcb
- kaart
- kaart num
- kaartaantal
- kaartaantallen
- kaarthouder
- kaarthouders
- karte
- karteninhaber
- karteninhabers
- kartennr
- kartennummer
- kreditkarte
- kreditkarten-nummer
- kreditkarteninhaber
- kreditkarteninstitut
- kreditkartennummer
- kreditkartentyp
- maestro
- master card
- master cards
- mastercard
- mastercards
- mc
- mister cash
- n carta
- carta
- no de tarjeta
- no do cartao
- no do cartão
- no. de tarjeta
- no. do cartao
- no. do cartão
- nr carta
- nr. carta
- numeri di scheda
- numero carta
- numero de cartao
- numero de carte
- numero de cartão
- numero de tarjeta
- numero della carta
- numero di carta
- numero di scheda
- numero do cartao
- numero do cartão
- numéro de carte
- nº carta
- nº de carte
- nº de la carte
- nº de tarjeta
- nº do cartao
- nº do cartão
- nº. do cartão
- número de cartao
- número de cartão
- número de tarjeta
- número do cartao
- scheda dell'assegno
- scheda dell'atmosfera
- scheda dell'atmosfera
- scheda della banca
- scheda di controllo
- scheda di debito
- scheda matrice
- schede dell'atmosfera
- schede di controllo
- schede di debito
- schede matrici
- scoprono la scheda
- scoprono le schede
- solo
- supporti di scheda
- supporto di scheda
- switch
- tarjeta atm
- tarjeta credito
- tarjeta de atm
- tarjeta de credito
- tarjeta de debito
- tarjeta debito
- tarjeta no
- tarjetahabiente
- tipo della scheda
- ufficio giapponese della
- scheda
- v pay
- v-pay
- visa
- visa plus
- visa electron
- visto
- visum
- vpay
Keyword_card_security_terms_dict
- card identification number
- card verification
- cardi la verifica
- cid
- cod seg
- cod seguranca
- cod segurança
- cod sicurezza
- cod. seg
- cod. seguranca
- cod. segurança
- cod. sicurezza
- codice di sicurezza
- codice di verifica
- codigo
- codigo de seguranca
- codigo de segurança
- crittogramma
- cryptogram
- cryptogramme
- cv2
- cvc
- cvc2
- cvn
- cvv
- cvv2
- cód seguranca
- cód segurança
- cód. seguranca
- cód. segurança
- código
- código de seguranca
- código de segurança
- de kaart controle
- geeft nr uit
- issue no
- issue number
- kaartidentificatienummer
- kreditkartenprufnummer
- kreditkartenprüfnummer
- kwestieaantal
- no. dell'edizione
- no. di sicurezza
- numero de securite
- numero de verificacao
- numero dell'edizione
- numero di identificazione della
- scheda
- numero di sicurezza
- numero van veiligheid
- numéro de sécurité
- nº autorizzazione
- número de verificação
- perno il blocco
- pin block
- prufziffer
- prüfziffer
- security code
- security no
- security number
- sicherheits kode
- sicherheitscode
- sicherheitsnummer
- speldblok
- veiligheid nr
- veiligheidsaantal
- veiligheidscode
- veiligheidsnummer
- verfalldatum
Keyword_card_expiration_terms_dict
- ablauf
- data de expiracao
- data de expiração
- data del exp
- data di exp
- data di scadenza
- data em que expira
- data scad
- data scadenza
- date de validité
- datum afloop
- datum van exp
- de afloop
- espira
- espira
- exp date
- exp datum
- expiration
- expire
- expires
- expiry
- fecha de expiracion
- fecha de venc
- gultig bis
- gultigkeitsdatum
- gültig bis
- gültigkeitsdatum
- la scadenza
- scadenza
- valable
- validade
- valido hasta
- valor
- venc
- vencimento
- vencimiento
- verloopt
- vervaldag
- vervaldatum
- vto
- válido hasta
Finland National ID
This sensitive information type entity is included in the EU National Identification Number sensitive information type and is available as a stand-alone sensitive information type entity.
Format
Six digits plus a character indicating a century plus three digits plus a check digit
Pattern
Pattern must include all of the items:
- Six digits in the format DDMMYY, which are a date of birth
- Century marker (either '-', '+' or 'a')
- Three-digit personal identification number
- A digit or letter (case insensitive), which is a check digit
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_finnish_national_id
finds content that matches the pattern. - A keyword from Keyword_finnish_national_id is found.
- The checksum passes.
<!-- Finnish National ID-->
<Entity id="338FD995-4CB5-4F87-AD35-79BD1DD926C1" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_finnish_national_id" />
<Match idRef="Keyword_finnish_national_id" />
</Pattern>
</Entity>
Keywords
Keyword_finnish_national_id
- ainutlaatuinen henkilökohtainen tunnus
- henkilökohtainen tunnus
- henkilötunnus
- henkilötunnusnumero#
- henkilötunnusnumero
- hetu
- id no
- id number
- identification number
- identiteetti numero
- identity number
- idnumber
- kansallinen henkilötunnus
- kansallisen henkilökortin
- national id card
- national id no.
- personal id
- personal identity code
- personalidnumber#
- personbeteckning
- personnummer
- social security number
- sosiaaliturvatunnus
- tax id
- tax identification no
- tax identification number
- tax no#
- tax no
- tax number
- tax registration number
- taxid#
- taxidno#
- taxidnumber#
- taxno#
- taxnumber#
- taxnumber
- tin id
- tin no
- tin#
- tunnistenumero
- tunnus numero
- tunnusluku
- tunnusnumero
- verokortti
- veronumero
- verotunniste
- verotunnus
France Driver's License Number
Format
12 digits
Pattern
12 digits with validation to discount similar patterns such as French telephone numbers
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_french_drivers_license
finds content that matches the pattern.At least one of the following statements is true:
- A keyword from Keyword_french_drivers_license is found.
- The function
Func_eu_date
finds a date in the right date format.
<!-- France Driver's License Number -->
<Entity id="18e55a36-a01b-4b0f-943d-dc10282a1824" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_french_drivers_license" />
<Any minMatches="1">
<Match idRef="Keyword_french_drivers_license" />
<Match idRef="Func_eu_date" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_french_drivers_license
- drivers licence
- drivers license
- driving licence
- driving license
- permis de conduire
- licence number
- license number
- licence numbers
- license numbers
France National ID Card (CNI)
Format
12 digits
Pattern
12 digits
Checksum
No
Definition
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_france_cni
finds content that matches the pattern.
<!-- France CNI -->
<Entity id="f741ac74-1bc0-4665-b69b-f0c7f927c0c4" patternsProximity="300" recommendedConfidence="65">
<Pattern confidenceLevel="65">
<IdMatch idRef="Regex_france_cni" />
</Pattern>
</Entity>
Keywords
None
France Passport Number
Format
Nine digits and letters
Pattern
Nine digits and letters:
- Two digits
- Two letters (not case sensitive)
- Five digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_fr_passport
finds content that matches the pattern. - A keyword from Keyword_passport is found.
<!-- France Passport Number -->
<Entity id="3008b884-8c8c-4cd8-a289-99f34fc7ff5d" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_fr_passport" />
<Match idRef="Keyword_passport" />
</Pattern>
</Entity>
Keywords
Keyword_passport
- Passport Number
- Passport No
- Passport #
- Passport#
- PassportID
- Passportno
- passportnumber
- パスポート
- パスポート番号
- パスポートのNum
- パスポート #
- Numéro de passeport
- Passeport n °
- Passeport Non
- Passeport #
- Passeport#
- PasseportNon
- Passeportn °
France Social Security Number (INSEE)
Format
15 digits
Pattern
Must match one of two patterns:
- 13 digits followed by a space followed by two digits or
- 15 consecutive digits
Checksum
Yes
Definition
A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_french_insee
orFunc_fr_insee
finds content that matches the pattern. - A keyword from Keyword_fr_insee is found.
- The checksum passes.
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_french_insee
orFunc_fr_insee
finds content that matches the pattern. - No keyword from Keyword_fr_insee is found.
- The checksum passes.
<!-- France INSEE -->
<Entity id="71f62b97-efe0-4aa1-aa49-e14de253619d" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="95">
<IdMatch idRef="Func_french_insee" />
<Match idRef="Func_fr_insee" />
<Any minMatches="1">
<Match idRef="Keyword_fr_insee" />
</Any>
</Pattern>
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_french_insee" />
<Match idRef="Func_fr_insee" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_fr_insee" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_fr_insee
- insee
- securité sociale
- securite sociale
- national id
- national identification
- numéro d'identité
- no d'identité
- no. d'identité
- numero d'identite
- no d'identite
- no. d'identite
- social security number
- social security code
- social insurance number
- le numéro d'identification nationale
- d'identité nationale
- numéro de sécurité sociale
- le code de la sécurité sociale
- numéro d'assurance sociale
- numéro de sécu
- code sécu
German Driver's License Number
Format
Combination of 11 digits and letters
Pattern
11 digits and letters (not case sensitive):
- A digit or letter
- Two digits
- Six digits or letters
- A digit
- A digit or letter
Checksum
Yes
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_german_drivers_license
finds content that matches the pattern.At least one of the following statements is true:
- A keyword from Keyword_german_drivers_license_number is found.
- A keyword from Keyword_german_drivers_license_collaborative is found.
- A keyword from Keyword_german_drivers_license is found.
The checksum passes.
<!-- Germany Driver's License Number -->
<Entity id="91da9335-1edb-45b7-a95f-5fe41a16c63c" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_german_drivers_license" />
<Any minMatches="1">
<Match idRef="Keyword_german_drivers_license_number" />
<Match idRef="Keyword_german_drivers_license_collaborative" />
<Match idRef="Keyword_german_drivers_license" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_german_drivers_license_number
- Führerschein
- Fuhrerschein
- Fuehrerschein
- Führerscheinnummer
- Fuhrerscheinnummer
- Fuehrerscheinnummer
- Führerschein-
- Fuhrerschein-
- Fuehrerschein-
- FührerscheinnummerNr
- FuhrerscheinnummerNr
- FuehrerscheinnummerNr
- FührerscheinnummerKlasse
- FuhrerscheinnummerKlasse
- FuehrerscheinnummerKlasse
- Führerschein- Nr
- Fuhrerschein- Nr
- Fuehrerschein- Nr
- Führerschein- Klasse
- Fuhrerschein- Klasse
- Fuehrerschein- Klasse
- FührerscheinnummerNr
- FuhrerscheinnummerNr
- FuehrerscheinnummerNr
- FührerscheinnummerKlasse
- FuhrerscheinnummerKlasse
- FuehrerscheinnummerKlasse
- Führerschein- Nr
- Fuhrerschein- Nr
- Fuehrerschein- Nr
- Führerschein- Klasse
- Fuhrerschein- Klasse
- Fuehrerschein- Klasse
- DL
- DLS
- Driv Lic
- Driv Licen
- Driv License
- Driv Licenses
- Driv Licence
- Driv Licences
- Driv Lic
- Driver Licen
- Driver License
- Driver Licenses
- Driver Licence
- Driver Licences
- Drivers Lic
- Drivers Licen
- Drivers License
- Drivers Licenses
- Drivers Licence
- Drivers Licences
- Driver's Lic
- Driver's Licen
- Driver's License
- Driver's Licenses
- Driver's Licence
- Driver's Licences
- Driving Lic
- Driving Licen
- Driving License
- Driving Licenses
- Driving Licence
- Driving Licences
Keyword_german_drivers_license_collaborative
- Nr-Führerschein
- Nr-Fuhrerschein
- Nr-Fuehrerschein
- No-Führerschein
- No-Fuhrerschein
- No-Fuehrerschein
- N-Führerschein
- N-Fuhrerschein
- N-Fuehrerschein
- Nr-Führerschein
- Nr-Fuhrerschein
- Nr-Fuehrerschein
- No-Führerschein
- No-Fuhrerschein
- No-Fuehrerschein
- N-Führerschein
- N-Fuhrerschein
- N-Fuehrerschein
Keyword_german_drivers_license
- ausstellungsdatum
- ausstellungsort
- ausstellende behöde
- ausstellende behorde
- ausstellende behoerde
German Passport Number
Format
10 digits or letters
Pattern
Pattern must include all of the items:
- First character is a digit or a letter from this set (C, F, G, H, J, K)
- Three digits
- Five digits or letters from this set (C, -H, J-N, P, R, T, V-Z)
- A digit
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_german_passport
finds content that matches the pattern. - A keyword from any of the five keyword lists is found.
- The checksum passes.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_german_passport_data
finds content that matches the pattern. - A keyword from any of the five keyword lists is found.
- The checksum passes.
<!-- Germany Passport Number -->
<Entity id="2e3da144-d42b-47ed-b123-fbf78604e52c" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_german_passport" />
<Any minMatches="1">
<Match idRef="Keyword_german_passport" />
<Match idRef="Keyword_german_passport_collaborative" />
<Match idRef="Keyword_german_passport_number" />
<Match idRef="Keyword_german_passport1" />
<Match idRef="Keyword_german_passport2" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_german_passport_data" />
<Any minMatches="1">
<Match idRef="Keyword_german_passport" />
<Match idRef="Keyword_german_passport_collaborative" />
<Match idRef="Keyword_german_passport_number" />
<Match idRef="Keyword_german_passport1" />
<Match idRef="Keyword_german_passport2" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_german_passport
- reisepass
- reisepasse
- reisepassnummer
- passport
- passports
Keyword_german_passport_collaborative
- geburtsdatum
- ausstellungsdatum
- ausstellungsort
Keyword_german_passport_number
No-Reisepass Nr-Reisepass
Keyword_german_passport1
Reisepass-Nr
Keyword_german_passport2
bnationalit.t
International banking account number (IBAN)
Format
Country code (two letters) plus check digits (two digits) plus bban number (up to 30 characters)
Pattern
Pattern must include all of the items:
- Two-letter country code
- Two check digits (followed by an optional space)
- 1-7 groups of four letters or digits (can be separated by spaces)
- 1-3 letters or digits
The format for each country is slightly different. The IBAN sensitive information type covers these 60 countries:
ad, ae, al, at, az, ba, be, bg, bh, ch, cr, cy, cz, de, dk, do, ee, es, fi, fo, fr, gb, ge, gi, gl, gr, hr, hu, ie, il, is, it, kw, kz, lb, li, lt, lu, lv, mc, md, me, mk, mr, mt, mu, nl, no, pl, pt, ro, rs, sa, se, si, sk, sm, tn, tr, vg
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_iban
finds content that matches the pattern. - The checksum passes.
<Entity id="e7dc4711-11b7-4cb0-b88b-2c394a771f0e" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_iban" />
</Pattern>
</Entity>
Keywords
None
IP address
Format
IPv4:
- Complex pattern, which accounts for formatted (periods) and unformatted (no periods) versions of the IPv4 addresses.
IPv6:
- Complex pattern, which accounts for formatted IPv6 numbers (which include colons).
Pattern
n/a
Checksum
No
Definition
For IPv6, a DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_ipv6_address
finds content that matches the pattern. - No keyword from Keyword_ipaddress is found.
For IPv4, a DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_ipv4_address
finds content that matches the pattern. - A keyword from Keyword_ipaddress is found.
For IPv6, a DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_ipv6_address
finds content that matches the pattern. - No keyword from Keyword_ipaddress is found.
<!-- IP Address -->
<Entity id="1daa4ad5-e2dd-4ca4-a788-54722c09efb2" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Regex_ipv6_address" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_ipaddress" />
</Any>
</Pattern>
<Pattern confidenceLevel="95">
<IdMatch idRef="Regex_ipv4_address" />
<Any minMatches="1">
<Match idRef="Keyword_ipaddress" />
</Any>
</Pattern>
<Pattern confidenceLevel="95">
<IdMatch idRef="Regex_ipv6_address" />
<Any minMatches="1">
<Match idRef="Keyword_ipaddress" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_ipaddress
- IP (this keyword is case sensitive)
- ip address
- ip addresses
- internet protocol
- IP-כתובת ה
Israel bank account number
Format
13 digits
Pattern
Formatted:
- Two digits
- A dash
- Three digits
- A dash
- Eight digits
Unformatted:
- 13 consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression Regex_israel_bank_account_number finds content that matches the pattern.
- A keyword from Keyword_israel_bank_account_number is found.
<!-- Israel Bank Account Number -->
<Entity id="7d08b2ff-a0b9-437f-957c-aeddbf9b2b25" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_israel_bank_account_number" />
<Any minMatches="1">
<Match idRef="Keyword_israel_bank_account_number" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_israel_bank_account_number
- Bank Account Number
- Bank Account
- Account Number
- מספר חשבון בנק
Israel National ID
Format
Nine digits
Pattern
Nine consecutive digits
Checksum
Yes
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_israeli_national_id_number
finds content that matches the pattern. - A keyword from Keyword_Israel_National_ID is found.
- The checksum passes.
<!-- Israel National ID Number -->
<Entity id="e05881f5-1db1-418c-89aa-a3ac5c5277ee" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_israeli_national_id_number" />
<Any minMatches="1">
<Match idRef="Keyword_Israel_National_ID" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_Israel_National_ID
- מספר זהות
- National ID Number
Italy Ddriver's License Number
Format
A combination of 10 letters and digits
Pattern
A combination of 10 letters and digits:
- One letter (not case sensitive)
- The letter "A" or "V" (not case sensitive)
- Seven letters (not case sensitive), digits, or the underscore character
- One letter (not case sensitive)
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_italy_drivers_license_number
finds content that matches the pattern. - A keyword from Keyword_italy_drivers_license_number is found.
<!-- Italy Driver's license Number -->
<Entity id="97d6244f-9157-41bd-8e0c-9d669a5c4d71" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_italy_drivers_license_number" />
<Any minMatches="1">
<Match idRef="Keyword_italy_drivers_license_number" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_italy_drivers_license_number
- numero di patente di guida
- patente di guida
Japan Bank Account Number
Format
Seven or eight digits
Pattern
Bank account number:
- Seven or eight digits
Bank account branch code:
- Four digits
- A space or dash (optional)
- Three digits
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_jp_bank_account
finds content that matches the pattern.A keyword from Keyword_jp_bank_accountis found.
One of the following statements is true:
- The function
Func_jp_bank_account_branch_code
finds content that matches the pattern. - A keyword from Keyword_jp_bank_branch_code is found.
- The function
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_jp_bank_account
finds content that matches the pattern. - A keyword from Keyword_jp_bank_account is found.
<!-- Japan Bank Account Number -->
<Entity id="d354f95b-96ee-4b80-80bc-4377312b55bc" patternsProximity="300" recommendedConfidence="75">
<Version minEngineVersion="15.01.0131.000">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_jp_bank_account" />
<Match idRef="Keyword_jp_bank_account" />
<Any minMatches="1">
<Match idRef="Func_jp_bank_account_branch_code" />
<Match idRef="Keyword_jp_bank_branch_code" />
</Any>
</Pattern>
</Version>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_jp_bank_account" />
<Match idRef="Keyword_jp_bank_account" />
</Pattern>
</Entity>
Keywords
Keyword_jp_bank_account
- Checking Account Number
- Checking Account
- Checking Account #
- Checking Acct Number
- Checking Acct #
- Checking Acct No.
- Checking Account No.
- Bank Account Number
- Bank Account
- Bank Account #
- Bank Acct Number
- Bank Acct #
- Bank Acct No.
- Bank Account No.
- Savings Account Number
- Savings Account
- Savings Account #
- Savings Acct Number
- Savings Acct #
- Savings Acct No.
- Savings Account No.
- Debit Account Number
- Debit Account
- Debit Account #
- Debit Acct Number
- Debit Acct #
- Debit Acct No.
- Debit Account No.
- 口座番号を当座預金口座の確認
- #アカウントの確認、勘定番号の確認
- #勘定の確認
- 勘定番号の確認
- 口座番号の確認
- 銀行口座番号
- 銀行口座
- 銀行口座#
- 銀行の勘定番号
- 銀行のacct#
- 銀行の勘定いいえ
- 銀行口座番号
- 普通預金口座番号
- 預金口座
- 貯蓄口座#
- 貯蓄勘定の数
- 貯蓄勘定#
- 貯蓄勘定番号
- 普通預金口座番号
- 引き落とし口座番号
- 口座番号
- 口座番号#
- デビットのacct番号
- デビット勘定#
- デビットACCTの番号
- デビット口座番号
Keyword_jp_bank_branch_code
Otemachi
Japan Driver's License Number
Format
12 digits
Pattern
12 consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_jp_drivers_license_number
finds content that matches the pattern. - A keyword from Keyword_jp_drivers_license_number is found.
<!-- Japan Driver's License Number -->
<Entity id="c6011143-d087-451c-8313-7f6d4aed2270" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_jp_drivers_license_number" />
<Match idRef ="Keyword_jp_drivers_license_number" />
</Pattern>
</Entity>
Keywords
Keyword_jp_drivers_license_number
- dl#
- DL#
- dls#
- DLS#
- driver license
- driver licenses
- drivers license
- driver's license
- drivers licenses
- driver's licenses
- driving licence
- lic#
- LIC#
- lics#
- state id
- state identification
- state identification number
- 低所得国#
- 免許証
- 状態ID
- 状態の識別
- 状態の識別番号
- 運転免許
- 運転免許証
- 運転免許証番号
Japan passport number
Format
Two letters followed by seven digits
Pattern
Two letters (not case sensitive) followed by seven digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_jp_passport
finds content that matches the pattern. - A keyword from Keyword_jp_passport is found.
<!-- Japan Passport Number -->
<Entity id="75177310-1a09-4613-bf6d-833aae3743f8" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_jp_passport" />
<Match idRef="Keyword_jp_passport" />
</Pattern>
</Entity>
Keywords
Keyword_jp_passport
- パスポート
- パスポート番号
- パスポートのNum
- パスポート#
Japan Resident Registration Number
Format
11 digits
Pattern
11 consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_jp_resident_registration_number
finds content that matches the pattern. - A keyword from Keyword_jp_resident_registration_number is found.
<!-- Japan Resident Registration Number -->
<Entity id="01c1209b-6389-4faf-a5f8-3f7e13899652" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_jp_resident_registration_number" />
<Match idRef ="Keyword_jp_resident_registration_number" />
</Pattern>
</Entity>
Keywords
Keyword_jp_resident_registration_number
- Resident Registration Number
- Resident Register Number
- Residents Basic Registry Number
- Resident Registration No.
- Resident Register No.
- Residents Basic Registry No.
- Basic Resident Register No.
- 住民登録番号、登録番号をレジデント
- 住民基本登録番号、登録番号
- 住民基本レジストリ番号を常駐
- 登録番号を常駐住民基本台帳登録番号
Japan Social Insurance Number (SIN)
Format
7-12 digits
Pattern
7-12 digits:
- Four digits
- A hyphen (optional)
- Six digits
OR
- 7-12 consecutive digits
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_jp_sin
finds content that matches the pattern. - A keyword from Keyword_jp_sin is found.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_jp_sin_pre_1997
finds content that matches the pattern. - A keyword from Keyword_jp_sin is found.
<!-- Japan Social Insurance Number -->
<Entity id="c840e719-0896-45bb-84fd-1ed5c95e45ff" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_jp_sin" />
<Match idRef="Keyword_jp_sin" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_jp_sin_pre_1997" />
<Match idRef="Keyword_jp_sin" />
</Pattern>
</Entity>
Keywords
Keyword_jp_sin
- Social Insurance No.
- Social Insurance Num
- Social Insurance Number
- 社会保険のテンキー
- 社会保険番号
New Zealand Ministry of Health Number
Format
Three letters, a space (optional), and four digits
Pattern
Three letters (not case sensitive) a space (optional) four digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_new_zealand_ministry_of_health_number
finds content that matches the pattern. - A keyword from Keyword_nz_terms is found.
- The checksum passes.
<!-- New Zealand Health Number -->
<Entity id="2b71c1c8-d14e-4430-82dc-fd1ed6bf05c7" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_new_zealand_ministry_of_health_number" />
<Any minMatches="1">
<Match idRef="Keyword_nz_terms" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_nz_terms
- NHI
- New Zealand
- Health
- treatment
Poland Identity Card
Format
Three letters and six digits
Pattern
Three letters (not case sensitive) followed by six digits
Checksum
Yes
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_polish_national_id
finds content that matches the pattern. - A keyword from Keyword_polish_national_id_passport_number is found.
- The checksum passes.
<!-- Poland Identity Card-->
<Entity id="25E64989-ED5D-40CA-A939-6C14183BB7BF" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_polish_national_id" />
<Match idRef="Keyword_polish_national_id_passport_number" />
</Pattern>
</Entity>
Keywords
Keyword_polish_national_id_passport_number
- Dowód osobisty
- Numer dowodu osobistego
- Nazwa i numer dowodu osobistego
- Nazwa i nr dowodu osobistego
- Nazwa i nr dowodu tożsamości
- Dowód Tożsamości
- dow. os.
Poland National ID (PESEL)
Format
11 digits
Pattern
11 consecutive digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_pesel_identification_number
finds content that matches the pattern. - A keyword from Keyword_pesel_identification_number is found.
- The checksum passes.
<!-- Poland National ID (PESEL) -->
<Entity id="E3AAF206-4297-412F-9E06-BA8487E22456" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_pesel_identification_number" />
<Match idRef="Keyword_pesel_identification_number" />
</Pattern>
</Entity>
Keywords
Keyword_pesel_identification_number
- Nr PESEL
- PESEL
Poland Passport
Format
Two letters and seven digits
Pattern
Two letters (not case sensitive) followed by seven digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_polish_passport_number
finds content that matches the pattern. - A keyword from Keyword_polish_national_id_passport_number is found.
- The checksum passes.
<!-- Poland Passport Number -->
<Entity id="03937FB5-D2B6-4487-B61F-0F8BFF7C3517" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_polish_passport_number" />
<Match idRef="Keyword_polish_national_id_passport_number" />
</Pattern>
</Entity>
</Version>
Keywords
Keyword_polish_national_id_passport_number
- Nazwa i nr dowodu tożsamości
- Dowód Tożsamości
- dow. os.
Saudi Arabia National ID
Format
10 digits
Pattern
10 consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_saudi_arabia_national_id
finds content that matches the pattern. - A keyword from Keyword_saudi_arabia_national_id is found.
<!-- Saudi Arabia National ID -->
<Entity id="8c5a0ba8-404a-41a3-8871-746aa21ee6c0" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_saudi_arabia_national_id" />
<Any minMatches="1">
<Match idRef="Keyword_saudi_arabia_national_id" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_saudi_arabia_national_id
- Identification Card
- I card number
- ID number
- الوطنية الهوية بطاقة رقم
Spain Social Security Number (SSN)
Format
11-12 digits
Pattern
11-12 digits:
- Two digits
- A forward slash (optional)
- 7-8 digits
- A forward slash (optional)
- Two digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_spanish_social_security_number
finds content that matches the pattern. - The checksum passes.
<!-- Spain SSN -->
<Entity id="5df987c0-8eae-4bce-ace7-b316347f3070" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_spanish_social_security_number" />
</Pattern>
</Entity>
Keywords
None
Sweden National ID
Format
10 or 12 digits and an optional delimiter
Pattern
10 or 12 digits and an optional delimiter:
- 2-4 digits (optional)
- Six digits in date format YYMMDD
- Delimiter of "-" or "+" (optional), plus
- Four digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_swedish_national_identifier
finds content that matches the pattern. - The checksum passes.
<!-- Sweden National ID -->
<Entity id="f69aaf40-79be-4fac-8f05-fd1910d272c8" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_swedish_national_identifier" />
</Pattern>
</Entity>
Keywords
None
Sweden Passport Number
Format
Eight digits
Pattern
Eight consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The regular expression Regex_sweden_passport_number finds content that matches the pattern.
One of the following statements is true:
- A keyword from Keyword_passport is found.
- A keyword from Keyword_sweden_passport is found.
<!-- Sweden Passport Number -->
<Entity id="ba4e7456-55a9-4d89-9140-c33673553526" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_sweden_passport_number" />
<Any minMatches="1">
<Match idRef="Keyword_passport" />
<Match idRef="Keyword_sweden_passport" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_sweden_passport
- visa requirements
- Alien Registration Card
- Schengen visas
- Schengen visa
- Visa Processing
- Visa Type
- Single Entry
- Multiple Entry
- G3 Processing Fees
Keyword_passport
- Passport Number
- Passport No
- Passport #
- Passport#
- PassportID
- Passportno
- passportnumber
- パスポート
- パスポート番号
- パスポートのNum
- パスポート#
- Numéro de passeport
- Passeport n °
- Passeport Non
- Passeport #
- Passeport#
- PasseportNon
- Passeportn °
SWIFT Code
Format
Four letters followed by 5-31 letters or digits
Pattern
Four letters followed by 5-31 letters or digits:
- Four-letter bank code (not case sensitive)
- An optional space
- 4-28 letters or digits (the Basic Bank Account Number (BBAN))
- An optional space
- 1-3 letters or digits (remainder of the BBAN)
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_swift
finds content that matches the pattern. - A keyword from Keyword_swift is found.
<Entity id="cb2ab58c-9cb8-4c81-baf8-a4e106791df4" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_swift" />
<Match idRef="Keyword_swift" />
</Pattern>
</Entity>
Keywords
Keyword_swift
- international organization for standardization 9362
- iso 9362
- iso9362
- swift#
- swiftcode
- swiftnumber
- swiftroutingnumber
- swift code
- swift number #
- swift routing number
- bic number
- bic code
- bic #
- bic#
- bank identifier code
- 標準化9362
- 迅速#
- SWIFTコード
- SWIFT番号
- 迅速なルーティング番号
- BIC番号
- BICコード
- 銀行識別コードのための国際組織
- Organisation internationale de normalisation 9362
- rapide #
- code SWIFT
- le numéro de swift
- swift numéro d'acheminement
- le numéro BIC
- # BIC
- code identificateur de banque
Taiwanese ID
Format
One letter (in English) followed by nine digits
Pattern
One letter (in English) followed by nine digits:
- One letter (in English, not case sensitive)
- The digit "1" or "2"
- Eight digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_taiwanese_national_id
finds content that matches the pattern. - A keyword from Keyword_taiwanese_national_id is found.
- The checksum passes.
<!-- Taiwanese National ID -->
<Entity id="4C7BFC34-8DD1-421D-8FB7-6C6182C2AF03" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_taiwanese_national_id" />
<Match idRef="Keyword_taiwanese_national_id" />
</Pattern>
</Entity>
Keywords
Keyword_taiwanese_national_id
- 身份證字號
- 身份證
- 身份證號碼
- 身份證號
- 身分證字號
- 身分證
- 身分證號碼
- 身份證號
- 身分證統一編號
- 國民身分證統一編號
- 簽名
- 蓋章
- 簽名或蓋章
- 簽章
U.K. Driver's License Number
Format
Combination of 18 letters and digits in the specified format
Pattern
18 letters and digits:
- Five letters (not case sensitive) or the digit "9" in place of a letter
- One digit
- Five digits in the date format MMDDY for date of birth (seventh character is incremented by 50 if driver is female, that is, 51 to 62 instead of 01 to 12)
- Two letters (not case sensitive) or the digit "9" in place of a letter
- Five digits
Checksum
Yes
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_uk_drivers_license
finds content that matches the pattern. - A keyword from Keyword_uk_drivers_license is found.
- The checksum passes.
<!-- U.K. Driver's License Number -->
<Entity id="f93de4be-d94c-40df-a8be-461738047551" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_uk_drivers_license" />
<Match idRef="Keyword_uk_drivers_license" />
</Pattern>
</Entity>
Keywords
Keyword_uk_drivers_license
- DVLA
- light vans
- quadbikes
- motor cars
- 125cc
- sidecar
- tricycles
- motorcycles
- photocard licence
- learner drivers
- licence holder
- licence holders
- driving licences
- driving licence
- dual control car
U.K. Electoral Roll Number
Format
Two letters followed by 1-4 digits
Pattern
Two letters (not case sensitive) followed by 1-4 numbers
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_uk_electoral
finds content that matches the pattern. - A keyword from Keyword_uk_electoral is found.
<!-- U.K. Electoral Number -->
<Entity id="a3eea206-dc0c-4f06-9e22-aa1be3059963" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_uk_electoral" />
<Any minMatches="1">
<Match idRef="Keyword_uk_electoral" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_uk_electoral
- council nomination
- nomination form
- electoral register
- electoral roll
U.K. National Health Service Number
Format
10-17 digits separated by spaces
Pattern
10-17 digits:
- Either 3 or 10 digits
- A space
- Three digits
- A space
- Four digits
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_uk_nhs_number
finds content that matches the pattern.One of the following statements is true:
- A keyword from Keyword_uk_nhs_number is found.
- A keyword from Keyword_uk_nhs_number1 is found.
- A keyword from Keyword_uk_nhs_number_dob is found.
The checksum passes.
<!-- U.K. NHS Number -->
<Entity id="3192014e-2a16-44e9-aa69-4b20375c9a78" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_uk_nhs_number" />
<Any minMatches="1">
<Match idRef="Keyword_uk_nhs_number" />
<Match idRef="Keyword_uk_nhs_number1" />
<Match idRef="Keyword_uk_nhs_number_dob" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_uk_nhs_number
- national health service
- nhs
- health services authority
- health authority
Keyword_uk_nhs_number1
- patient id
- patient identification
- patient no
- patient number
Keyword_uk_nhs_number_dob
- GP
- DOB
- D.O.B
- Date of Birth
- Birth Date
U.K. National Insurance Number (NINO)
Format
Seven characters or nine characters separated by spaces or dashes
Pattern
Two possible patterns:
- Two letters (valid NINOs use only certain characters in this prefix, which this pattern validates; not case sensitive)
- Six digits
- Either 'A', 'B', 'C', or 'D' (like the prefix, only certain characters are allowed in the suffix; not case sensitive)
OR
- Two letters
- A space or dash
- Two digits
- A space or dash
- Two digits
- A space or dash
- Two digits
- A space or dash
- Either 'A', 'B', 'C', or 'D'
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_uk_nino
finds content that matches the pattern. - A keyword from Keyword_uk_nino is found.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_uk_nino
finds content that matches the pattern. - No keyword from Keyword_uk_nino is found.
<!-- U.K. NINO -->
<Entity id="16c07343-c26f-49d2-a987-3daf717e94cc" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_uk_nino" />
<Any minMatches="1">
<Match idRef="Keyword_uk_nino" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_uk_nino" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_uk_nino" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_uk_nino
- national insurance number
- national insurance contributions
- protection act
- insurance
- social security number
- insurance application
- medical application
- social insurance
- medical attention
- social security
- great britain
- insurance
U.S. / U.K. Passport Number
Format
Nine digits
Pattern
Nine consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_usa_uk_passport
finds content that matches the pattern. - A keyword from Keyword_passport is found.
<Entity id="178ec42a-18b4-47cc-85c7-d62c92fd67f8" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_usa_uk_passport" />
<Match idRef="Keyword_passport" />
</Pattern>
</Entity>
Keywords
Keyword_passport
- Passport Number
- Passport No
- Passport #
- Passport#
- PassportID
- Passportno
- passportnumber
- パスポート
- パスポート番号
- パスポートのNum
- パスポート#
- Numéro de passeport
- Passeport n °
- Passeport Non
- Passeport #
- Passeport#
- PasseportNon
- Passeportn °
U.S. Bank Account Number
Format
8-17 digits
Pattern
8-17 consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The regular expression
Regex_usa_bank_account_number
finds content that matches the pattern. - A keyword from Keyword_usa_Bank_Account is found.
<!-- U.S. Bank Account Number -->
<Entity id="a2ce32a8-f935-4bb6-8e96-2a5157672e2c" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_usa_bank_account_number" />
<Match idRef="Keyword_usa_Bank_Account" />
</Pattern>
</Entity>
Keywords
Keyword_usa_Bank_Account
- Checking Account Number
- Checking Account
- Checking Account #
- Checking Acct Number
- Checking Acct #
- Checking Acct No.
- Checking Account No.
- Bank Account Number
- Bank Account #
- Bank Acct Number
- Bank Acct #
- Bank Acct No.
- Bank Account No.
- Savings Account Number
- Savings Account.
- Savings Account #
- Savings Acct Number
- Savings Acct #
- Savings Acct No.
- Savings Account No.
- Debit Account Number
- Debit Account
- Debit Account #
- Debit Acct Number
- Debit Acct #
- Debit Acct No.
- Debit Account No.
U.S. Driver's License Number
Format
Depends on the state
Pattern
Depends on the state—for example, New York:
- Nine digits formatted like ddd ddd ddd will match.
- Nine digits like ddddddddd won't match.
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_new_york_drivers_license_number
finds content that matches the pattern. - A keyword from Keyword_[state_name]_drivers_license_name is found.
- A keyword from Keyword_us_drivers_license is found.
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_new_york_drivers_license_number
finds content that matches the pattern. - A keyword from Keyword_[state_name]_drivers_license_name is found.
- A keyword from Keyword_us_drivers_license_abbreviations is found.
- No keyword from Keyword_us_drivers_license is found.
<Entity id="dfeb356f-61cd-459e-bf0f-7c6d28b458c6 patternsProximity="300">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_new_york_drivers_license_number" />
<Match idRef="Keyword_new_york_drivers_license_name" />
<Match idRef="Keyword_us_drivers_license" />
</Pattern>
<Pattern confidenceLevel="65">
<IdMatch idRef="Func_new_york_drivers_license_number" />
<Match idRef="Keyword_new_york_drivers_license_name" />
<Match idRef="Keyword_us_drivers_license_abbreviations" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_us_drivers_license" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_us_drivers_license_abbreviations
- DL
- DLS
- CDL
- CDLS
- ID
- IDs
- DL#
- DLS#
- CDL#
- CDLS#
- ID#
- IDs#
- ID number
- ID numbers
- LIC
- LIC#
Keyword_us_drivers_license
- DriverLic
- DriverLics
- DriverLicense
- DriverLicenses
- Driver Lic
- Driver Lics
- Driver License
- Driver Licenses
- DriversLic
- DriversLics
- DriversLicense
- DriversLicenses
- Drivers Lic
- Drivers Lics
- Drivers License
- Drivers Licenses
- Driver'Lic
- Driver'Lics
- Driver'License
- Driver'Licenses
- Driver' Lic
- Driver' Lics
- Driver' License
- Driver' Licenses
- Driver'sLic
- Driver'sLics
- Driver'sLicense
- Driver'sLicenses
- Driver's Lic
- Driver's Lics
- Driver's License
- Driver's Licenses
- identification number
- identification numbers
- identification #
- id card
- id cards
- identification card
- identification cards
- DriverLic#
- DriverLics#
- DriverLicense#
- DriverLicenses#
- Driver Lic#
- Driver Lics#
- Driver License#
- Driver Licenses#
- DriversLic#
- DriversLics#
- DriversLicense#
- DriversLicenses#
- Drivers Lic#
- Drivers Lics#
- Drivers License#
- Drivers Licenses#
- Driver'Lic#
- Driver'Lics#
- Driver'License#
- Driver'Licenses#
- Driver' Lic#
- Driver' Lics#
- Driver' License#
- Driver' Licenses#
- Driver'sLic#
- Driver'sLics#
- Driver'sLicense#
- Driver'sLicenses#
- Driver's Lic#
- Driver's Lics#
- Driver's License#
- Driver's Licenses#
- id card#
- id cards#
- identification card#
- identification cards#
Keyword_[state_name]_drivers_license_name
- State abbreviation (for example, "NY")
- State name (for example, "New York")
U.S. Individual Taxpayer Identification Number (ITIN)
Format
Nine digits that start with a "9" and include a "7" or "8" as the fourth digit, optionally formatted with spaces or dashes
Pattern
Formatted:
- The digit "9"
- Two digits
- A space or dash
- A "7" or "8"
- A digit
- A space, or dash
- Four digits
Unformatted:
- The digit "9"
- Two digits
- A "7" or "8"
- Five digits
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_formatted_itin
finds content that matches the pattern.At least one of the following statements is true:
- A keyword from Keyword_itin is found.
- The function
Func_us_address
finds an address in the right date format. - The function
Func_us_date finds
a date in the right date format. - A keyword from Keyword_itin_collaborative is found.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function
Func_unformatted_itin
finds content that matches the pattern.At least one of the following statements is true:
- A keyword from Keyword_itin_collaborative is found.
- The function
Func_us_address
finds an address in the right date format. - The function
Func_us_date
finds a date in the right date format.
<!-- U.S. Individual Taxpayer Identification Number (ITIN) -->
<Entity id="e55e2a32-f92d-4985-a35d-a0b269eb687b" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_formatted_itin" />
<Any minMatches="1">
<Match idRef="Keyword_itin" />
<Match idRef="Func_us_address" />
<Match idRef="Func_us_date" />
<Match idRef="Keyword_itin_collaborative" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_unformatted_itin" />
<Match idRef="Keyword_itin" />
<Any minMatches="1">
<Match idRef="Keyword_itin_collaborative" />
<Match idRef="Func_us_address" />
<Match idRef="Func_us_date" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_itin
- taxpayer
- tax id
- tax identification
- itin
- ssn
- tin
- social security
- tax payer
- itins
- taxid
- individual taxpayer
Keyword_itin_collaborative
- License
- DL
- DOB
- Birthdate
- Birthday
- Date of Birth
U.S. Social Security Number (SSN)
Format
Nine digits, which may be in a formatted or unformatted pattern
Note
If issued before mid-2011, an SSN has strong formatting where certain parts of the number must fall within certain ranges to be valid (but there's no checksum).
Pattern
Four functions look for SSNs in four different patterns:
Func_ssn
finds SSNs with pre-2011 strong formatting that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)Func_unformatted_ssn
finds SSNs with pre-2011 strong formatting that are unformatted as nine consecutive digits (ddddddddd)Func_randomized_formatted_ssn
finds post-2011 SSNs that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)Func_randomized_unformatted_ssn
finds post-2011 SSNs that are unformatted as nine consecutive digits (ddddddddd)
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_ssn
finds content that matches the pattern. - A keyword from Keyword_ssn is found.
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_unformatted_ssn
finds content that matches the pattern. - A keyword from Keyword_ssn is found.
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_randomized_formatted_ssn
finds content that matches the pattern. - A keyword from Keyword_ssn is found.
A DLP policy is 55% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
- The function
Func_randomized_unformatted_ssn
finds content that matches the pattern. - A keyword from Keyword_ssn is found.
<!-- U.S. Social Security Number (SSN) -->
<Entity id="a44669fe-0d48-453d-a9b1-2cc83f2cba77" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_ssn" />
<Match idRef="Keyword_ssn" />
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_unformatted_ssn" />
<Match idRef="Keyword_ssn" />
</Pattern>
<Pattern confidenceLevel="65">
<IdMatch idRef="Func_randomized_formatted_ssn" />
<Match idRef="Keyword_ssn" />
</Pattern>
<Pattern confidenceLevel="55">
<IdMatch idRef="Func_randomized_unformatted_ssn" />
<Match idRef="Keyword_ssn" />
</Pattern>
</Entity>
Keywords
Keyword_ssn
- Social Security
- Social Security#
- Soc Sec
- SSN
- SSNS
- SSN#
- SS#
- SSID