Data loss prevention (DLP) in SharePoint Server 2016 includes ten sensitive information types that are ready for you to use in your DLP policies. This topic lists all of these sensitive information types and shows what a DLP policy looks for when it detects each type. A sensitive information type is defined by a pattern that can be identified by a regular expression or a function. In addition, corroborative evidence such as keywords and checksums can be used to identify a sensitive information type. Confidence level and proximity are also used in the evaluation process.
Format |
9 digits which may be in a formatted or unformatted pattern |
||
---|---|---|---|
Pattern |
Formatted:
Unformatted:
|
||
Checksum |
No |
||
Definition |
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||
Keywords |
|
Format |
16 digits which can be formatted or unformatted (dddddddddddddddd) and must pass the Luhn test. |
||||
---|---|---|---|---|---|
Pattern |
Very complex and robust pattern that detects cards from all major brands worldwide, including Visa, MasterCard, Discover Card, JCB, American Express, gift cards, and diner cards. |
||||
Checksum |
Yes, the Luhn checksum |
||||
Definition |
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||||
Keywords |
|
Format |
16 digits |
||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Pattern |
Very complex and robust pattern |
||||||||||
Checksum |
Yes |
||||||||||
Definition |
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||||||||||
Keywords |
|
Format |
Four letters followed by 5-31 letters or digits |
||
---|---|---|---|
Pattern |
Four letters followed by 5-31 letters or digits:
|
||
Checksum |
No |
||
Definition |
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||
Keywords |
|
Format |
7 characters or 9 characters separated by spaces or dashes |
||
---|---|---|---|
Pattern |
Two possible patterns:
OR
|
||
Checksum |
No |
||
Definition |
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||
Keywords |
|
Format |
Nine digits |
||
---|---|---|---|
Pattern |
Nine consecutive digits |
||
Checksum |
No |
||
Definition |
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||
Keywords |
|
Format |
4-17 digits |
||
---|---|---|---|
Pattern |
4-17 consecutive digits |
||
Checksum |
No |
||
Definition |
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||
Keywords |
|
Format |
Depends on the state |
||||||||
---|---|---|---|---|---|---|---|---|---|
Pattern |
Depends on the state -- for example, New York:
|
||||||||
Checksum |
No |
||||||||
Definition |
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||||||||
Keywords |
|
Format |
Nine digits that start with a "9" and contain a "7" or "8" as the fourth digit, optionally formatted with spaces or dashes |
||||||
---|---|---|---|---|---|---|---|
Pattern |
Formatted:
Unformatted:
|
||||||
Checksum |
No |
||||||
Definition |
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||||||
Keywords |
|
Format |
9 digits, which may be in a formatted or unformatted pattern Note: If issued before mid-2011, an SSN has strong formatting where certain parts of the number must fall within certain ranges to be valid (but there's no checksum). |
||
---|---|---|---|
Pattern |
Four functions look for SSNs in four different patterns:
|
||
Checksum |
No |
||
Definition |
A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
A DLP policy is 55% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:
|
||
Keywords |
|