Data loss prevention (DLP) in SharePoint Server 2016 includes ten sensitive information types that are ready for you to use in your DLP policies. This topic lists all of these sensitive information types and shows what a DLP policy looks for when it detects each type. A sensitive information type is defined by a pattern that can be identified by a regular expression or a function. In addition, corroborative evidence such as keywords and checksums can be used to identify a sensitive information type. Confidence level and proximity are also used in the evaluation process.
ABA Routing Number
Format
9 digits which may be in a formatted or unformatted pattern
Pattern
Formatted:
Four digits beginning with 0, 1, 2, 3, 6, 7, or 8
A hyphen
Four digits
A hyphen
A digit
Unformatted:
9 consecutive digits beginning with 0, 1, 2, 3, 6, 7, or 8
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<!-- ABA Routing Number -->
<Entity id="cb353f78-2b72-4c3c-8827-92ebe4f69fdf" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_aba_routing" />
<Match idRef="Keyword_ABA_Routing" />
</Pattern>
</Entity>
Keywords
Keyword_ABA_Routing
aba
aba #
aba routing #
aba routing number
aba#
abarouting#
aba number
abaroutingnumber
american bank association routing #
american bank association routing number
americanbankassociationrouting#
americanbankassociationroutingnumber
bank routing number
bankrouting#
bankroutingnumber
routing transit number
RTN
Credit Card Number
Format
16 digits which can be formatted or unformatted (dddddddddddddddd) and must pass the Luhn test.
Pattern
Very complex and robust pattern that detects cards from all major brands worldwide, including Visa, MasterCard, Discover Card, JCB, American Express, gift cards, and diner cards.
Checksum
Yes, the Luhn checksum
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<!-- Credit Card Number -->
<Entity id="50842eb7-edc8-4019-85dd-5a5c1f2bb085" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_credit_card" />
<Any minMatches="1">
<Match idRef="Keyword_cc_verification" />
<Match idRef="Keyword_cc_name" />
<Match idRef="Func_expiration_date" />
</Any>
</Pattern>
<Pattern confidenceLevel="65">
<IdMatch idRef="Func_credit_card" />
</Pattern>
</Entity>
Keywords
Keyword_cc_verification
Keyword_cc_name
card verification
card identification number
cvn
cid
cvc2
cvv2
pin block
security code
security number
security no
issue number
issue no
cryptogramme
numéro de sécurité
numero de securite
kreditkartenprüfnummer
kreditkartenprufnummer
prüfziffer
prufziffer
sicherheits Kode
sicherheitscode
sicherheitsnummer
verfalldatum
codice di verifica
cod. sicurezza
cod sicurezza
n autorizzazione
código
codigo
cod. seg
cod seg
código de segurança
codigo de seguranca
codigo de segurança
código de seguranca
cód. segurança
cod. seguranca
cod. segurança
cód. seguranca
cód segurança
cod seguranca
cod segurança
cód seguranca
número de verificação
numero de verificacao
ablauf
gültig bis
gültigkeitsdatum
gultig bis
gultigkeitsdatum
scadenza
data scad
fecha de expiracion
fecha de venc
vencimiento
válido hasta
valido hasta
vto
data de expiração
data de expiracao
data em que expira
validade
valor
vencimento
Venc
amex
american express
americanexpress
Visa
mastercard
master card
mc
mastercards
master cards
diner's Club
diners club
dinersclub
discover card
discovercard
discover cards
JCB
japanese card bureau
carte blanche
carteblanche
credit card
cc#
cc#:
expiration date
exp date
expiry date
date d’expiration
date d'exp
date expiration
bank card
bankcard
card number
card num
cardnumber
cardnumbers
card numbers
creditcard
credit cards
creditcards
ccn
card holder
cardholder
card holders
cardholders
check card
checkcard
check cards
checkcards
debit card
debitcard
debit cards
debitcards
atm card
atmcard
atm cards
atmcards
enroute
en route
card type
carte bancaire
carte de crédit
carte de credit
numéro de carte
numero de carte
nº de la carte
nº de carte
kreditkarte
karte
karteninhaber
karteninhabers
kreditkarteninhaber
kreditkarteninstitut
kreditkartentyp
eigentümername
kartennr
kartennummer
kreditkartennummer
kreditkarten-nummer
carta di credito
carta credito
n. carta
n carta
nr. carta
nr carta
numero carta
numero della carta
numero di carta
tarjeta credito
tarjeta de credito
tarjeta crédito
tarjeta de crédito
tarjeta de atm
tarjeta atm
tarjeta debito
tarjeta de debito
tarjeta débito
tarjeta de débito
nº de tarjeta
no. de tarjeta
no de tarjeta
numero de tarjeta
número de tarjeta
tarjeta no
tarjetahabiente
cartão de crédito
cartão de credito
cartao de crédito
cartao de credito
cartão de débito
cartao de débito
cartão de debito
cartao de debito
débito automático
debito automatico
número do cartão
numero do cartão
número do cartao
numero do cartao
número de cartão
numero de cartão
número de cartao
numero de cartao
nº do cartão
nº do cartao
nº. do cartão
no do cartão
no do cartao
no. do cartão
no. do cartao
EU Debit Card Number
Format
16 digits
Pattern
Very complex and robust pattern
Checksum
Yes
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<!-- EU Debit Card Number -->
<Entity id="0e9b3178-9678-47dd-a509-37222ca96b42" patternsProximity="300" recommendedConfidence="85">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_eu_debit_card" />
<Any minMatches="1">
<Match idRef="Keyword_eu_debit_card" />
<Match idRef="Keyword_card_terms_dict" />
<Match idRef="Keyword_card_security_terms_dict" />
<Match idRef="Keyword_card_expiration_terms_dict" />
<Match idRef="Func_expiration_date" />
<Match idRef="Func_eu_date" />
<Match idRef="Func_eu_date1" />
<Match idRef="Func_eu_date2" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_eu_debit_card
Keyword_card_terms_dict
Keyword_card_security_terms_dict
Keyword_card_expiration_terms_dict
account number
card number
card no.
security number
cc#
acct nbr
acct num
acct no
american express
americanexpress
americano espresso
amex
atm card
atm cards
atm kaart
atmcard
atmcards
atmkaart
atmkaarten
bancontact
bank card
bankkaart
card holder
card holders
card num
card number
card numbers
card type
cardano numerico
cardholder
cardholders
cardnumber
cardnumbers
carta bianca
carta credito
carta di credito
cartao de credito
cartao de crédito
cartao de debito
cartao de débito
carte bancaire
carte blanche
carte bleue
carte de credit
carte de crédit
carte di credito
carteblanche
cartão de credito
cartão de crédito
cartão de debito
cartão de débito
cb
ccn
check card
check cards
checkcard
checkcards
chequekaart
cirrus
cirrus-edc-maestro
controlekaart
controlekaarten
credit card
credit cards
creditcard
creditcards
debetkaart
debetkaarten
debit card
debit cards
debitcard
debitcards
debito automatico
diners club
dinersclub
discover
discover card
discover cards
discovercard
discovercards
débito automático
edc
eigentümername
european debit card
hoofdkaart
hoofdkaarten
in viaggio
japanese card bureau
japanse kaartdienst
jcb
kaart
kaart num
kaartaantal
kaartaantallen
kaarthouder
kaarthouders
karte
karteninhaber
karteninhabers
kartennr
kartennummer
kreditkarte
kreditkarten-nummer
kreditkarteninhaber
kreditkarteninstitut
kreditkartennummer
kreditkartentyp
maestro
master card
master cards
mastercard
mastercards
mc
mister cash
n carta
n. carta
no de tarjeta
no do cartao
no do cartão
no. de tarjeta
no. do cartao
no. do cartão
nr carta
nr. carta
numeri di scheda
numero carta
numero de cartao
numero de carte
numero de cartão
numero de tarjeta
numero della carta
numero di carta
numero di scheda
numero do cartao
numero do cartão
numéro de carte
nº carta
nº de carte
nº de la carte
nº de tarjeta
nº do cartao
nº do cartão
nº. do cartão
número de cartao
número de cartão
número de tarjeta
número do cartao
scheda dell'assegno
scheda dell'atmosfera
scheda dell'atmosfera
scheda della banca
scheda di controllo
scheda di debito
scheda matrice
schede dell'atmosfera
schede di controllo
schede di debito
schede matrici
scoprono la scheda
scoprono le schede
solo
supporti di scheda
supporto di scheda
switch
tarjeta atm
tarjeta credito
tarjeta de atm
tarjeta de credito
tarjeta de debito
tarjeta debito
tarjeta no
tarjetahabiente
tipo della scheda
ufficio giapponese della
scheda
v pay
v-pay
visa
visa plus
visa electron
visto
visum
vpay
card identification number
card verification
cardi la verifica
cid
cod seg
cod seguranca
cod segurança
cod sicurezza
cod. seg
cod. seguranca
cod. segurança
cod. sicurezza
codice di sicurezza
codice di verifica
codigo
codigo de seguranca
codigo de segurança
crittogramma
cryptogram
cryptogramme
cv2
cvc
cvc2
cvn
cvv
cvv2
cód seguranca
cód segurança
cód. seguranca
cód. segurança
código
código de seguranca
código de segurança
de kaart controle
geeft nr uit
issue no
issue number
kaartidentificatienummer
kreditkartenprufnummer
kreditkartenprüfnummer
kwestieaantal
no. dell'edizione
no. di sicurezza
numero de securite
numero de verificacao
numero dell'edizione
numero di identificazione della
scheda
numero di sicurezza
numero van veiligheid
numéro de sécurité
nº autorizzazione
número de verificação
perno il blocco
pin block
prufziffer
prüfziffer
security code
security no
security number
sicherheits kode
sicherheitscode
sicherheitsnummer
speldblok
veiligheid nr
veiligheidsaantal
veiligheidscode
veiligheidsnummer
verfalldatum
ablauf
data de expiracao
data de expiração
data del exp
data di exp
data di scadenza
data em que expira
data scad
data scadenza
date de validité
datum afloop
datum van exp
de afloop
espira
espira
exp date
exp datum
expiration
expire
expires
expiry
fecha de expiracion
fecha de venc
gultig bis
gultigkeitsdatum
gültig bis
gültigkeitsdatum
la scadenza
scadenza
valable
validade
valido hasta
valor
venc
vencimento
vencimiento
verloopt
vervaldag
vervaldatum
vto
válido hasta
SWIFT Code
Format
Four letters followed by 5-31 letters or digits
Pattern
Four letters followed by 5-31 letters or digits:
Four-letter bank code (not case sensitive)
An optional space
4-28 letters or digits (the Basic Bank Account Number (BBAN))
An optional space
1-3 letters or digits (remainder of the BBAN)
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<Entity id="cb2ab58c-9cb8-4c81-baf8-a4e106791df4" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_swift" />
<Match idRef="Keyword_swift" />
</Pattern>
</Entity>
Keywords
Keyword_swift
international organization for standardization 9362
iso 9362
iso9362
swift#
swiftcode
swiftnumber
swiftroutingnumber
swift code
swift number #
swift routing number
bic number
bic code
bic #
bic#
bank identifier code
標準化9362
迅速#
SWIFTコード
SWIFT番号
迅速なルーティング番号
BIC番号
BICコード
銀行識別コードのための国際組織
Organisation internationale de normalisation 9362
rapide #
code SWIFT
le numéro de swift
swift numéro d'acheminement
le numéro BIC
# BIC
code identificateur de banque
U.K. National Insurance Number (NINO)
Format
7 characters or 9 characters separated by spaces or dashes
Pattern
Two possible patterns:
Two letters (valid NINOs use only certain characters in this prefix, which this pattern validates; not case sensitive)
Six digits
Either 'A', 'B', 'C', or 'D' (like the prefix, only certain characters are allowed in the suffix; not case sensitive)
OR
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<!-- U.K. NINO -->
<Entity id="16c07343-c26f-49d2-a987-3daf717e94cc" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_uk_nino" />
<Any minMatches="1">
<Match idRef="Keyword_uk_nino" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_uk_nino" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_uk_nino" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_uk_nino
national insurance number
national insurance contributions
protection act
insurance
social security number
insurance application
medical application
social insurance
medical attention
social security
great britain
insurance
U.S. / U.K. Passport Number
Format
Nine digits
Pattern
Nine consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<Entity id="178ec42a-18b4-47cc-85c7-d62c92fd67f8" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_usa_uk_passport" />
<Match idRef="Keyword_passport" />
</Pattern>
</Entity>
Keywords
Keyword_passport
Passport Number
Passport No
Passport #
Passport#
PassportID
Passportno
passportnumber
パスポート
パスポート番号
パスポートのNum
パスポート#
Numéro de passeport
Passeport n °
Passeport Non
Passeport #
Passeport#
PasseportNon
Passeportn °
U.S. Bank Account Number
Format
4-17 digits
Pattern
4-17 consecutive digits
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<!-- U.S. Bank Account Number -->
<Entity id="a2ce32a8-f935-4bb6-8e96-2a5157672e2c" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
<IdMatch idRef="Regex_usa_bank_account_number" />
<Match idRef="Keyword_usa_Bank_Account" />
</Pattern>
</Entity>
Keywords
Keyword_usa_Bank_Account
Checking Account Number
Checking Account
Checking Account #
Checking Acct Number
Checking Acct #
Checking Acct No.
Checking Account No.
Bank Account Number
Bank Account #
Bank Acct Number
Bank Acct #
Bank Acct No.
Bank Account No.
Savings Account Number
Savings Account.
Savings Account #
Savings Acct Number
Savings Acct #
Savings Acct No.
Savings Account No.
Debit Account Number
Debit Account
Debit Account #
Debit Acct Number
Debit Acct #
Debit Acct No.
Debit Account No.
U.S. Driver's License Number
Format
Depends on the state
Pattern
Depends on the state -- for example, New York:
Checksum
No
Definition
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function Func_new_york_drivers_license_number finds content that matches the pattern.
A keyword from Keyword_[state_name]_drivers_license_name is found.
A keyword from Keyword_us_drivers_license is found.
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function Func_new_york_drivers_license_number finds content that matches the pattern.
A keyword from Keyword_[state_name]_drivers_license_name is found.
A keyword from Keyword_us_drivers_license_abbreviations is found.
No keyword from Keyword_us_drivers_license is found.
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_new_york_drivers_license_number" />
<Match idRef="Keyword_new_york_drivers_license_name" />
<Match idRef="Keyword_us_drivers_license" />
</Pattern>
<Pattern confidenceLevel="65">
<IdMatch idRef="Func_new_york_drivers_license_number" />
<Match idRef="Keyword_new_york_drivers_license_name" />
<Match idRef="Keyword_us_drivers_license_abbreviations" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Keyword_us_drivers_license" />
</Any>
</Pattern>
Keywords
Keyword_us_drivers_license_abbreviations
Keyword_us_drivers_license
Keyword_[state_name]_drivers_license_name
DL
DLS
CDL
CDLS
ID
IDs
DL#
DLS#
CDL#
CDLS#
ID#
IDs#
ID number
ID numbers
LIC
LIC#
DriverLic
DriverLics
DriverLicense
DriverLicenses
Driver Lic
Driver Lics
Driver License
Driver Licenses
DriversLic
DriversLics
DriversLicense
DriversLicenses
Drivers Lic
Drivers Lics
Drivers License
Drivers Licenses
Driver'Lic
Driver'Lics
Driver'License
Driver'Licenses
Driver' Lic
Driver' Lics
Driver' License
Driver' Licenses
Driver'sLic
Driver'sLics
Driver'sLicense
Driver'sLicenses
Driver's Lic
Driver's Lics
Driver's License
Driver's Licenses
identification number
identification numbers
identification #
id card
id cards
identification card
identification cards
DriverLic#
DriverLics#
DriverLicense#
DriverLicenses#
Driver Lic#
Driver Lics#
Driver License#
Driver Licenses#
DriversLic#
DriversLics#
DriversLicense#
DriversLicenses#
Drivers Lic#
Drivers Lics#
Drivers License#
Drivers Licenses#
Driver'Lic#
Driver'Lics#
Driver'License#
Driver'Licenses#
Driver' Lic#
Driver' Lics#
Driver' License#
Driver' Licenses#
Driver'sLic#
Driver'sLics#
Driver'sLicense#
Driver'sLicenses#
Driver's Lic#
Driver's Lics#
Driver's License#
Driver's Licenses#
id card#
id cards#
identification card#
identification cards#
State abbreviation (for example, "NY")
State name (for example, "New York")
U.S. Individual Taxpayer Identification Number (ITIN)
Format
Nine digits that start with a "9" and contain a "7" or "8" as the fourth digit, optionally formatted with spaces or dashes
Pattern
Formatted:
The digit "9"
Two digits
A space or dash
A "7" or "8"
A digit
A space, or dash
Four digits
Unformatted:
The digit "9"
Two digits
A "7" or "8"
Five digits
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
<!-- U.S. Individual Taxpayer Identification Number (ITIN) -->
<Entity id="e55e2a32-f92d-4985-a35d-a0b269eb687b" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_formatted_itin" />
<Any minMatches="1">
<Match idRef="Keyword_itin" />
<Match idRef="Func_us_address" />
<Match idRef="Func_us_date" />
<Match idRef="Keyword_itin_collaborative" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_unformatted_itin" />
<Match idRef="Keyword_itin" />
<Any minMatches="1">
<Match idRef="Keyword_itin_collaborative" />
<Match idRef="Func_us_address" />
<Match idRef="Func_us_date" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_itin
Keyword_itin_collaborative
taxpayer
tax id
tax identification
itin
ssn
tin
social security
tax payer
itins
taxid
individual taxpayer
License
DL
DOB
Birthdate
Birthday
Date of Birth
U.S. Social Security Number (SSN)
Format
9 digits, which may be in a formatted or unformatted pattern
Note: If issued before mid-2011, an SSN has strong formatting where certain parts of the number must fall within certain ranges to be valid (but there's no checksum).
Pattern
Four functions look for SSNs in four different patterns:
Func_ssn finds SSNs with pre-2011 strong formatting that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)
Func_unformatted_ssn finds SSNs with pre-2011 strong formatting that are unformatted as nine consecutive digits (ddddddddd)
Func_randomized_formatted_ssn finds post-2011 SSNs that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)
Func_randomized_unformatted_ssn finds post-2011 SSNs that are unformatted as nine consecutive digits (ddddddddd)
Checksum
No
Definition
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function Func_unformatted_ssn finds content that matches the pattern.
A keyword from Keyword_ssn is found.
At least one of the following is true:
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function Func_randomized_formatted_ssn finds content that matches the pattern.
The function Func_ssn does not find content that matches the pattern.
At least one of the following is true:
A keyword from Keyword_ssn is found.
The function Func_us_date finds a date in the right date format.
The function Func_us_address finds an address in the right date format.
A DLP policy is 55% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
The function Func_randomized_unformatted_ssn finds content that matches the pattern.
A keyword from Keyword_ssn is found.
The function Func_unformatted_ssn does not find content that matches the pattern.
At least one of the following is true:
<!-- U.S. Social Security Number (SSN) -->
<Entity id="a44669fe-0d48-453d-a9b1-2cc83f2cba77" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="85">
<IdMatch idRef="Func_ssn" />
<Any minMatches="1">
<Match idRef="Keyword_ssn" />
<Match idRef="Func_us_date" />
<Match idRef="Func_us_address" />
</Any>
</Pattern>
<Pattern confidenceLevel="75">
<IdMatch idRef="Func_unformatted_ssn" />
<Match idRef="Keyword_ssn" />
<Any minMatches="1">
<Match idRef="Func_us_date" />
<Match idRef="Func_us_address" />
</Any>
</Pattern>
<Pattern confidenceLevel="65">
<IdMatch idRef="Func_randomized_formatted_ssn" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Func_ssn" />
</Any>
<Any minMatches="1">
<Match idRef="Keyword_ssn" />
<Match idRef="Func_us_date" />
<Match idRef="Func_us_address" />
</Any>
</Pattern>
<Pattern confidenceLevel="55">
<IdMatch idRef="Func_randomized_unformatted_ssn" />
<Match idRef="Keyword_ssn" />
<Any minMatches="0" maxMatches="0">
<Match idRef="Func_unformatted_ssn" />
</Any>
<Any minMatches="1">
<Match idRef="Func_us_date" />
<Match idRef="Func_us_address" />
</Any>
</Pattern>
</Entity>
Keywords
Keyword_ssn
Social Security
Social Security#
Soc Sec
SSN
SSNS
SSN#
SS#
SSID