What the sensitive information types in SharePoint Server 2016 look for

Data loss prevention (DLP) in SharePoint Server 2016 includes ten sensitive information types that are ready for you to use in your DLP policies. This topic lists all of these sensitive information types and shows what a DLP policy looks for when it detects each type. A sensitive information type is defined by a pattern that can be identified by a regular expression or a function. In addition, corroborative evidence such as keywords and checksums can be used to identify a sensitive information type. Confidence level and proximity are also used in the evaluation process.

Format

9 digits which may be in a formatted or unformatted pattern

Pattern

Formatted:

  • Four digits beginning with 0, 1, 2, 3, 6, 7, or 8

  • A hyphen

  • Four digits

  • A hyphen

  • A digit

Unformatted:

  • 9 consecutive digits beginning with 0, 1, 2, 3, 6, 7, or 8

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_aba_routing finds content that matches the pattern.

  • A keyword from Keyword_ABA_Routing is found.

<!-- ABA Routing Number -->
<Entity id="cb353f78-2b72-4c3c-8827-92ebe4f69fdf" patternsProximity="300" recommendedConfidence="75">
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_aba_routing" />
        <Match idRef="Keyword_ABA_Routing" />
      </Pattern>
 </Entity>

Keywords

Keyword_ABA_Routing

aba

aba #

aba routing #

aba routing number

aba#

abarouting#

aba number

abaroutingnumber

american bank association routing #

american bank association routing number

americanbankassociationrouting#

americanbankassociationroutingnumber

bank routing number

bankrouting#

bankroutingnumber

routing transit number

RTN

Format

14 digits which can be formatted or unformatted (dddddddddddddd) and must pass the Luhn test.

Pattern

Very complex and robust pattern that detects cards from all major brands worldwide, including Visa, MasterCard, Discover Card, JCB, American Express, gift cards, and diner cards.

Checksum

Yes, the Luhn checksum

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_credit_card finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_cc_verification is found.

    • A keyword from Keyword_cc_name is found.

    • The function Func_expiration_date finds a date in the right date format.

  • The checksum passes.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_credit_card finds content that matches the pattern.

  • The checksum passes.

<!-- Credit Card Number -->
<Entity id="50842eb7-edc8-4019-85dd-5a5c1f2bb085" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_credit_card" />
        <Any minMatches="1">
          <Match idRef="Keyword_cc_verification" />
          <Match idRef="Keyword_cc_name" />
          <Match idRef="Func_expiration_date" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_credit_card" />
  </Pattern>
</Entity>

Keywords

Keyword_cc_verification

Keyword_cc_name

card verification

card identification number

cvn

cid

cvc2

cvv2

pin block

security code

security number

security no

issue number

issue no

cryptogramme

numéro de sécurité

numero de securite

kreditkartenprüfnummer

kreditkartenprufnummer

prüfziffer

prufziffer

sicherheits Kode

sicherheitscode

sicherheitsnummer

verfalldatum

codice di verifica

cod. sicurezza

cod sicurezza

n autorizzazione

código

codigo

cod. seg

cod seg

código de segurança

codigo de seguranca

codigo de segurança

código de seguranca

cód. segurança

cod. seguranca cod. segurança

cód. seguranca

cód segurança

cod seguranca cod segurança

cód seguranca

número de verificação

numero de verificacao

ablauf

gültig bis

gültigkeitsdatum

gultig bis

gultigkeitsdatum

scadenza

data scad

fecha de expiracion

fecha de venc

vencimiento

válido hasta

valido hasta

vto

data de expiração

data de expiracao

data em que expira

validade

valor

vencimento

Venc

amex

american express

americanexpress

Visa

mastercard

master card

mc

mastercards

master cards

diner's Club

diners club

dinersclub

discover card

discovercard

discover cards

JCB

japanese card bureau

carte blanche

carteblanche

credit card

cc#

cc#:

expiration date

exp date

expiry date

date d’expiration

date d'exp

date expiration

bank card

bankcard

card number

card num

cardnumber

cardnumbers

card numbers

creditcard

credit cards

creditcards

ccn

card holder

cardholder

card holders

cardholders

check card

checkcard

check cards

checkcards

debit card

debitcard

debit cards

debitcards

atm card

atmcard

atm cards

atmcards

enroute

en route

card type

carte bancaire

carte de crédit

carte de credit

numéro de carte

numero de carte

nº de la carte

nº de carte

kreditkarte

karte

karteninhaber

karteninhabers

kreditkarteninhaber

kreditkarteninstitut

kreditkartentyp

eigentümername

kartennr

kartennummer

kreditkartennummer

kreditkarten-nummer

carta di credito

carta credito

n. carta

n carta

nr. carta

nr carta

numero carta

numero della carta

numero di carta

tarjeta credito

tarjeta de credito

tarjeta crédito

tarjeta de crédito

tarjeta de atm

tarjeta atm

tarjeta debito

tarjeta de debito

tarjeta débito

tarjeta de débito

nº de tarjeta

no. de tarjeta

no de tarjeta

numero de tarjeta

número de tarjeta

tarjeta no

tarjetahabiente

cartão de crédito

cartão de credito

cartao de crédito

cartao de credito

cartão de débito

cartao de débito

cartão de debito

cartao de debito

débito automático

debito automatico

número do cartão

numero do cartão

número do cartao

numero do cartao

número de cartão

numero de cartão

número de cartao

numero de cartao

nº do cartão

nº do cartao

nº. do cartão

no do cartão

no do cartao

no. do cartão

no. do cartao

Format

16 digits

Pattern

Very complex and robust pattern

Checksum

Yes

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_eu_debit_card finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_eu_debit_card is found.

    • A keyword from Keyword_card_terms_dict is found.

    • A keyword from Keyword_card_security_terms_dict is found.

    • A keyword from Keyword_card_expiration_terms_dict is found.

    • The function Func_eu_date1 finds a date in the right date format.

    • The function Func_eu_date2 finds a date in the right date format.

  • The checksum passes.

<!-- EU Debit Card Number -->
<Entity id="0e9b3178-9678-47dd-a509-37222ca96b42" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_eu_debit_card" />
        <Any minMatches="1">
          <Match idRef="Keyword_eu_debit_card" />
          <Match idRef="Keyword_card_terms_dict" />
          <Match idRef="Keyword_card_security_terms_dict" />
          <Match idRef="Keyword_card_expiration_terms_dict" />
          <Match idRef="Func_expiration_date" />
          <Match idRef="Func_eu_date" />
          <Match idRef="Func_eu_date1" />
          <Match idRef="Func_eu_date2" />
        </Any>
  </Pattern>
</Entity>

Keywords

Keyword_eu_debit_card

Keyword_card_terms_dict

Keyword_card_security_terms_dict

Keyword_card_expiration_terms_dict

account number

card number

card no.

security number

cc#

acct nbr

acct num

acct no

american express

americanexpress

americano espresso

amex

atm card

atm cards

atm kaart

atmcard

atmcards

atmkaart

atmkaarten

bancontact

bank card

bankkaart

card holder

card holders

card num

card number

card numbers

card type

cardano numerico

cardholder

cardholders

cardnumber

cardnumbers

carta bianca

carta credito

carta di credito

cartao de credito

cartao de crédito

cartao de debito

cartao de débito

carte bancaire

carte blanche

carte bleue

carte de credit

carte de crédit

carte di credito

carteblanche

cartão de credito

cartão de crédito

cartão de debito

cartão de débito

cb

ccn

check card

check cards

checkcard

checkcards

chequekaart

cirrus

cirrus-edc-maestro

controlekaart

controlekaarten

credit card

credit cards

creditcard

creditcards

debetkaart

debetkaarten

debit card

debit cards

debitcard

debitcards

debito automatico

diners club

dinersclub

discover

discover card

discover cards

discovercard

discovercards

débito automático

edc

eigentümername

european debit card

hoofdkaart

hoofdkaarten

in viaggio

japanese card bureau

japanse kaartdienst

jcb

kaart

kaart num

kaartaantal

kaartaantallen

kaarthouder

kaarthouders

karte

karteninhaber

karteninhabers

kartennr

kartennummer

kreditkarte

kreditkarten-nummer

kreditkarteninhaber

kreditkarteninstitut

kreditkartennummer

kreditkartentyp

maestro

master card

master cards

mastercard

mastercards

mc

mister cash

n carta

n. carta

no de tarjeta

no do cartao

no do cartão

no. de tarjeta

no. do cartao

no. do cartão

nr carta

nr. carta

numeri di scheda

numero carta

numero de cartao

numero de carte

numero de cartão

numero de tarjeta

numero della carta

numero di carta

numero di scheda

numero do cartao

numero do cartão

numéro de carte

nº carta

nº de carte

nº de la carte

nº de tarjeta

nº do cartao

nº do cartão

nº. do cartão

número de cartao

número de cartão

número de tarjeta

número do cartao

scheda dell'assegno

scheda dell'atmosfera

scheda dell'atmosfera

scheda della banca

scheda di controllo

scheda di debito

scheda matrice

schede dell'atmosfera

schede di controllo

schede di debito

schede matrici

scoprono la scheda

scoprono le schede

solo

supporti di scheda

supporto di scheda

switch

tarjeta atm

tarjeta credito

tarjeta de atm

tarjeta de credito

tarjeta de debito

tarjeta debito

tarjeta no

tarjetahabiente

tipo della scheda

ufficio giapponese della

scheda

v pay

v-pay

visa

visa plus

visa electron

visto

visum

vpay

card identification number

card verification

cardi la verifica

cid

cod seg

cod seguranca

cod segurança

cod sicurezza

cod. seg

cod. seguranca

cod. segurança

cod. sicurezza

codice di sicurezza

codice di verifica

codigo

codigo de seguranca

codigo de segurança

crittogramma

cryptogram

cryptogramme

cv2

cvc

cvc2

cvn

cvv

cvv2

cód seguranca

cód segurança

cód. seguranca

cód. segurança

código

código de seguranca

código de segurança

de kaart controle

geeft nr uit

issue no

issue number

kaartidentificatienummer

kreditkartenprufnummer

kreditkartenprüfnummer

kwestieaantal

no. dell'edizione

no. di sicurezza

numero de securite

numero de verificacao

numero dell'edizione

numero di identificazione della

scheda

numero di sicurezza

numero van veiligheid

numéro de sécurité

nº autorizzazione

número de verificação

perno il blocco

pin block

prufziffer

prüfziffer

security code

security no

security number

sicherheits kode

sicherheitscode

sicherheitsnummer

speldblok

veiligheid nr

veiligheidsaantal

veiligheidscode

veiligheidsnummer

verfalldatum

ablauf

data de expiracao

data de expiração

data del exp

data di exp

data di scadenza

data em que expira

data scad

data scadenza

date de validité

datum afloop

datum van exp

de afloop

espira

espira

exp date

exp datum

expiration

expire

expires

expiry

fecha de expiracion

fecha de venc

gultig bis

gultigkeitsdatum

gültig bis

gültigkeitsdatum

la scadenza

scadenza

valable

validade

valido hasta

valor

venc

vencimento

vencimiento

verloopt

vervaldag

vervaldatum

vto

válido hasta

Format

Four letters followed by 5-31 letters or digits

Pattern

Four letters followed by 5-31 letters or digits:

  • Four-letter bank code (not case sensitive)

  • An optional space

  • 4-28 letters or digits (the Basic Bank Account Number (BBAN))

  • An optional space

  • 1-3 letters or digits (remainder of the BBAN)

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_swift finds content that matches the pattern.

  • A keyword from Keyword_swift is found.

<Entity id="cb2ab58c-9cb8-4c81-baf8-a4e106791df4" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_swift" />
        <Match idRef="Keyword_swift" />
    </Pattern>
</Entity>

Keywords

Keyword_swift

international organization for standardization 9362

iso 9362

iso9362

swift#

swiftcode

swiftnumber

swiftroutingnumber

swift code

swift number #

swift routing number

bic number

bic code

bic #

bic#

bank identifier code

標準化9362

迅速#

SWIFTコード

SWIFT番号

迅速なルーティング番号

BIC番号

BICコード

銀行識別コードのための国際組織

Organisation internationale de normalisation 9362

rapide #

code SWIFT

le numéro de swift

swift numéro d'acheminement

le numéro BIC

# BIC

code identificateur de banque

Format

14 or 15 characters separated by spaces

Pattern

14 or 15 characters separated by spaces:

  • Two-letter code (not case sensitive)

  • Six digits

  • A space or dash

  • Two digits

  • A space or dash

  • Two digits

  • A space or dash

  • Two digits

  • A letter or a space

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nino finds content that matches the pattern.

  • A keyword from Keyword_uk_nino is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nino finds content that matches the pattern.

  • No keyword from Keyword_uk_nino is found.

<!-- U.K. NINO -->
<Entity id="16c07343-c26f-49d2-a987-3daf717e94cc" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_uk_nino" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_nino" />
        </Any>
    </Pattern>    
     <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_uk_nino" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_uk_nino" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_uk_nino

national insurance number

national insurance contributions

protection act

insurance

social security number

insurance application

medical application

social insurance

medical attention

social security

great britain

insurance

Format

Nine digits

Pattern

Nine consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_usa_uk_passport finds content that matches the pattern.

  • A keyword from Keyword_passport is found.

<Entity id="178ec42a-18b4-47cc-85c7-d62c92fd67f8" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_usa_uk_passport" />
        <Match idRef="Keyword_passport" />
    </Pattern>
</Entity>

Keywords

Keyword_passport

Passport Number

Passport No

Passport #

Passport#

PassportID

Passportno

passportnumber

パスポート

パスポート番号

パスポートのNum

パスポート#

Numéro de passeport

Passeport n °

Passeport Non

Passeport #

Passeport#

PasseportNon

Passeportn °

Format

4-17 digits

Pattern

4-17 consecutive digits

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_usa_bank_account_number finds content that matches the pattern.

  • A keyword from Keyword_usa_Bank_Account is found.

<!-- U.S. Bank Account Number -->
<Entity id="a2ce32a8-f935-4bb6-8e96-2a5157672e2c" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_usa_bank_account_number" />
        <Match idRef="Keyword_usa_Bank_Account" />
    </Pattern>
</Entity>

Keywords

Keyword_usa_Bank_Account

Checking Account Number

Checking Account

Checking Account #

Checking Acct Number

Checking Acct #

Checking Acct No.

Checking Account No.

Bank Account Number

Bank Account #

Bank Acct Number

Bank Acct #

Bank Acct No.

Bank Account No.

Savings Account Number

Savings Account.

Savings Account #

Savings Acct Number

Savings Acct #

Savings Acct No.

Savings Account No.

Debit Account Number

Debit Account

Debit Account #

Debit Acct Number

Debit Acct #

Debit Acct No.

Debit Account No.

Format

Depends on the state

Pattern

Depends on the state -- for example, New York:

  • Nine digits formatted like ddd ddd ddd will match.

  • Nine digits like ddddddddd will not match.

Checksum

No

Definition

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_york_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_[state_name]_drivers_license_name is found.

  • A keyword from Keyword_us_drivers_license is found.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_york_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_[state_name]_drivers_license_name is found.

  • A keyword from Keyword_us_drivers_license_abbreviations is found.

  • No keyword from Keyword_us_drivers_license is found.

    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_new_york_drivers_license_number" />
        <Match idRef="Keyword_new_york_drivers_license_name" />
        <Match idRef="Keyword_us_drivers_license" />
    </Pattern>
    <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_new_york_drivers_license_number" />
        <Match idRef="Keyword_new_york_drivers_license_name" />
        <Match idRef="Keyword_us_drivers_license_abbreviations" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_us_drivers_license" />
        </Any>
    </Pattern>

Keywords

Keyword_us_drivers_license_abbreviations

Keyword_us_drivers_license

Keyword_[state_name]_drivers_license_name

DL

DLS

CDL

CDLS

ID

IDs

DL#

DLS#

CDL#

CDLS#

ID#

IDs#

ID number

ID numbers

LIC

LIC#

DriverLic

DriverLics

DriverLicense

DriverLicenses

Driver Lic

Driver Lics

Driver License

Driver Licenses

DriversLic

DriversLics

DriversLicense

DriversLicenses

Drivers Lic

Drivers Lics

Drivers License

Drivers Licenses

Driver'Lic

Driver'Lics

Driver'License

Driver'Licenses

Driver' Lic

Driver' Lics

Driver' License

Driver' Licenses

Driver'sLic

Driver'sLics

Driver'sLicense

Driver'sLicenses

Driver's Lic

Driver's Lics

Driver's License

Driver's Licenses

identification number

identification numbers

identification #

id card

id cards

identification card

identification cards

DriverLic#

DriverLics#

DriverLicense#

DriverLicenses#

Driver Lic#

Driver Lics#

Driver License#

Driver Licenses#

DriversLic#

DriversLics#

DriversLicense#

DriversLicenses#

Drivers Lic#

Drivers Lics#

Drivers License#

Drivers Licenses#

Driver'Lic#

Driver'Lics#

Driver'License#

Driver'Licenses#

Driver' Lic#

Driver' Lics#

Driver' License#

Driver' Licenses#

Driver'sLic#

Driver'sLics#

Driver'sLicense#

Driver'sLicenses#

Driver's Lic#

Driver's Lics#

Driver's License#

Driver's Licenses#

id card#

id cards#

identification card#

identification cards#

State abbreviation (for example, "NY")

State name (for example, "New York")

Format

Nine digits that start with a "9" and contain a "7" or "8" as the fourth digit, optionally formatted with spaces or dashes

Pattern

Formatted:

  • The digit "9"

  • Two digits

  • A space or dash

  • A "7" or "8"

  • A digit

  • A space, or dash

  • Four digits

Unformatted:

  • The digit "9"

  • Two digits

  • A "7" or "8"

  • Five digits

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_formatted_itin finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_itin is found.

    • The function Func_us_address finds an address in the right date format.

    • The function Func_us_date finds a date in the right date format.

    • A keyword from Keyword_itin_collaborative is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_itin finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_itin_collaborative is found.

    • The function Func_us_address finds an address in the right date format.

    • The function Func_us_date finds a date in the right date format.

<!-- U.S. Individual Taxpayer Identification Number (ITIN) -->
<Entity id="e55e2a32-f92d-4985-a35d-a0b269eb687b" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_formatted_itin" />
        <Any minMatches="1">
          <Match idRef="Keyword_itin" />
          <Match idRef="Func_us_address" />
          <Match idRef="Func_us_date" />
          <Match idRef="Keyword_itin_collaborative" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_itin" />
        <Match idRef="Keyword_itin" />
        <Any minMatches="1">
          <Match idRef="Keyword_itin_collaborative" />
          <Match idRef="Func_us_address" />
          <Match idRef="Func_us_date" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_itin

Keyword_itin_collaborative

taxpayer

tax id

tax identification

itin

ssn

tin

social security

tax payer

itins

taxid

individual taxpayer

License

DL

DOB

Birthdate

Birthday

Date of Birth

Format

9 digits, which may be in a formatted or unformatted pattern

Note: If issued before mid-2011, an SSN has strong formatting where certain parts of the number must fall within certain ranges to be valid (but there's no checksum).

Pattern

Four functions look for SSNs in four different patterns:

  • Func_ssn finds SSNs with pre-2011 strong formatting that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)

  • Func_unformatted_ssn finds SSNs with pre-2011 strong formatting that are unformatted as nine consecutive digits (ddddddddd)

  • Func_randomized_formatted_ssn finds post-2011 SSNs that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)

  • Func_randomized_unformatted_ssn finds post-2011 SSNs that are unformatted as nine consecutive digits (ddddddddd)

Checksum

No

Definition

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_ssn finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_ssn is found.

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_ssn finds content that matches the pattern.

  • A keyword from Keyword_ssn is found.

  • At least one of the following is true:

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_randomized_formatted_ssn finds content that matches the pattern.

  • The function Func_ssn does not find content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_ssn is found.

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

A DLP policy is 55% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_randomized_unformatted_ssn finds content that matches the pattern.

  • A keyword from Keyword_ssn is found.

  • The function Func_unformatted_ssn does not find content that matches the pattern.

  • At least one of the following is true:

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

<!-- U.S. Social Security Number (SSN) -->
<Entity id="a44669fe-0d48-453d-a9b1-2cc83f2cba77" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_ssn" />
        <Any minMatches="1">
          <Match idRef="Keyword_ssn" />
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_ssn" />
        <Match idRef="Keyword_ssn" />
        <Any minMatches="1">
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_randomized_formatted_ssn" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Func_ssn" />
        </Any>
        <Any minMatches="1">
          <Match idRef="Keyword_ssn" />
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="55">
        <IdMatch idRef="Func_randomized_unformatted_ssn" />
        <Match idRef="Keyword_ssn" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Func_unformatted_ssn" />
        </Any>
        <Any minMatches="1">
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
</Entity>

Keywords

Keyword_ssn

Social Security

Social Security#

Soc Sec

SSN

SSNS

SSN#

SS#

SSID

Share Facebook Facebook Twitter Twitter Email Email

Was this information helpful?

Great! Any other feedback?

How can we improve it?

Thank you for your feedback!

×