Date post: | 30-Dec-2015 |
Category: |
Documents |
Upload: | ohanzee-ojeda |
View: | 43 times |
Download: | 0 times |
© 2005 IBM Corporation
DB2 Entity Analytic Solutions
Answering the Question “Who is Who?” “Who Knows Who?” and “Who Knows Who Anonymously?”
Darren BaldwinWorld Wide Manager Entity Analytics
How do we shorten the distance between detect and preempt?
2 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
IIID theft loss = $221 billion
Growing at 300%($2 Trillion by EOY 05)
Aberdeen Group
Inability to uniquely identify clients is the leading cause for failure of CRM applications. Oddly
enough, almost the same information and analysis is needed for AML and CRM – Name,
Address, structure and names of family members and business relationships…
Gartner Group, BCS Banking Analytics Paper
Money Laundering = $600 billion to $1.8 trillion of the world's annual economic activity
Gannett News Service
Criminal’s are using “Front” Business and “Ghost” Employees to
appear normal- IBM BCS
42% of all ID theft is credit card fraud
MasterCard International
25% of Americans say it’s okay to defraud insurers
Accenture
Collusion between employeesand 3rd parties accounts for 48% percent of all fraudulent activities.
American Banker
Financial Services Companies have some BIG issues!
Information Management
3
What Does the Entity Analytic Solutions Family Do?
1. Disambiguate and aggregate Identities
2. Relate Disambiguated Identities, reveal obvious and non-obvious relationships, generate real-time alerts.
3. “Anonymyze” all identities to protect privacy.
= =
As;dflkjasfwefklwjeroiewuroewiru2309321409sdflkac;lakdscas;dlfkajdsf;laskdjf;asdfpowefwefoijewf;lksajdf;laksdjfasdofiweoiwjef;ewf;ljweflkjdsflkjdsaflkdddddddddd
A;sdlfkjasfqworiwqureowqiru19328913282112lkjwa;lfkjasd;flkajsdf;lakdsfjdsalfkjasd;flkjsadf;lkasdjf;alasdfasdflkjsadf;lkajfsd;lkjfds;lkasdfowqerwqoirpoirewquweoisadfk
Asdfl;kajewq;lfkjweoijwefoweifjweofijwefoiwejfoewiru320983210932148lsakadlkasdmcas;dlfkjdsag;lkajsdsadlkjasdg;lasdg;lkasjdg;laskdjg;lkjsadg;lkjsad;sadddddddd
Qkejroqwejroiwqerjqwoeri;jewqwqoirwqe987rewq0987rewqqw9re87qwre098723r923l23kj4;l32k4nm324;lk32n2;l3kn324;lkna;lkaaef;lknawfoiewfew;lkfnewf;l;lewkfnflknl
Alsdkfjasfofjqofjwaf;oiwefoweifjo320230230r9jr32pojas;ldka;lckamds;lkasmdf;alsdkfjsad;flkajsdf;laskdjfwopeijewfpoiwjef;lksadf;lsakdfjasldfkjsad;lfkja;sldkfldsafkjldskj
A;lsdkfjpasld;kfjaf;lkasjfd;asldkfjasd;flksajfd;lwkejfew;lkrewoiruewporiuew;oisad;lksd;lkasdcmlaskdmcasd;lkasdfja;sdlfkjasdf;lksajdf;lkjsadf;lkja;dslksjdf;lklllllsadfddd
Asdf;lkjasdf;laskdfjewoiurweoric;kc.nC<Mnsadclkjadsf;lkasjfd;lksjfda;lksadfnlkasdfasdfldsaf;lkajsdf;lksajdf;laksjdf;laksdjfweew98320913201298ewlkflkjlldsaflaslddldld
Adslfkj21019221j12lafjas;lfkajsf;sadf092jf091jf1lfjafdlkajsfd;laskfdjasld;fkjsad0f9asjfd02i32jf;lkanfd;alskfdjasdf;lkjsad;lfkjasdf0-9dsafj-sa9dfjasdflkj;lkalsfdkj;lsadkjfasd;f
Asdflkjlsakfdja;lsdfkjsa;lfdkjasf;laksjfweoewij2;l3kj32rl23krj32lrkj32r;23jr2o3rij23roi23jro23ijr32orij23ori2j3ro23irj23r;lo32jr23fri2r3o2i3ro32irjoijoijwefoijaf;laksdjsadflkjsa
As;dlfkjasdf;lsakdfjwefowiefjwe098ew8u2332oir321r1ru032103289r320o23awofepihwfepoihafpowihwe98hwaefoihewfpoihfKJ><njCAkjancdaslkdvosadivnpowaivcoewieie
!
!!
4 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
II
EAS’s Powerful Identity Recognition Platform
5 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
II
Some of our Clients
Ultimate Who’s Who and Who Knows Who Systems
Subject based Analysis – Many Degrees
Know Your Customer ComplianceAnd Customer Disambiguation
100’s of systems and 100 Million Records
Employee, Vendor and Customer Relationship & Collusion Detection
Actual picture of one of our Customers
Complaint Department!
Information Management
6
When You Don’t Know “Who is Who?”, You Don’t Have a Complete Picture of an Individual!
Mark Smith11 Burmuda RdHudson MA 01510Tel#978-368-5312SSN# 027-70-3732EIN#097376156DOB 01/03/64
Identity Resolution detects and identifies a single person from multiple sources even if the data is insufficient, incorrect or
fraudulent
Randy Smith11 BurrmudaHudson MA 01510TEL#978-368-6423LIC#1702188364EIN#097376156DOB 01/03/65
Mark Randall Smith101 First St, APT 3689Little Rock, AR 72202Tel#501-654-5404Cell#978-368-3555LIC#027703732DOB 01/03/64
Profile 4 Profile 3Profile 2 Profile 1
Mark R Smith
10 BermudaHutson MA 01510Tel#978-368-5312 Cell#978-368-3555LIC#1702188346 PPN# 086588345
Information Management
7
Master Consolidated
Unique Entity ID/Unique person Identifier
Step 1Name Standardization
Step 2Address Standardization
Step 3Data Quality
Step 4DataEnhancement
Entity Resolution
Enterprise Entity ID Resolution
Consolidates Multiple Systems Entity/ID’s
Generates Unique Entity ID
Full Data Attribution Retention & Resolution
Enterprise Entity Analytics
Continuous, real-time, validation and correction of ingested data
Data
Silo 1
NAMES
Mark SmithMark R SmithRandall SmithMark Randy Smith
ADDRESSES
11 BermudaHudson, MA 02334
10 Burmuda St.Hutson, MA 01512
P.O. Box 12743Clinton, MA 01510
ADDITIONAL
DOB: 12/13/71Phone:(508)278-6019
(978)365-6631(501)661-8044
Work: Zycast Int.SilverbackKinear
Entity #14465
PII = Personally Identifiable Information
Data
Silo 2
Data
Silo 3
IBM Software Group | DB2 Information Management Software
8IBM Confidential
Name– Last name– First name– Middle name– Other name parts– Generation– Organization name– Aliases– Gender
Location– Address 1– Address 2– Address 3– City– State/province– Postal code– Country– Latitude/longitude
Identifiers– Driver’s license– Grantors– Beneficiaries– References– Emergency contacts
– Account Number – Person ID– Tax ID– Business ID– Phone– Email address… user defined
Attributes– Acct Number Interactions– Date of birth– Circa date of birth– Nationality– Place of birth– Height– Weight– Eye color– Hair color– Dependents– … user defined
Identity Resolution – Example Data Points
IBM Software Group | DB2 Information Management Software
9IBM Confidential
The EAS Identity Repository – Identity Folder
Names Marc R Smith A-#70001
Randal Smith B-#009102
Mark Randy Smith
C-#6251
Address 123 Main St. A-#70001
456 First Street C-#6251
Phones (713) 730-5769 A-#70001
(713) 731-5577 B-#009102
(713) 731-5577 C-#6251
SSN 537-27-6402 A-#70001
DL 0001133107 A-#70001
1133107 C-#6251
DOB 06/17/1974 B-#009103
Mark SmithEntity #144465
Entity #144465
Mark R Smith
Randall Smith
Mark Randy Smith
Mark Smith
Information Management
10
Introducing, Relationship Resolution
Identity Deltas and Contagious Data Association
Degrees of separation for Investigation
Identify Relationships between Entities
Information Management
11
Relationship Resolution
AddressHygiene
Data Quality
DataEnhancement
NameStandardization
IdentityResolution
NAMES
Jack SmithJack BlackJack B NimberJack B QuickJack Rabbitt
ADDRESSES
P.O. Box 1227Denver, CO 80112
415 Lancaster St Worcester, MA 01609
ADDITIONAL
DOB: 07/03/69Phone:(303)778-1210
(508)278-6019Work: U.S. Air ForceWife: Kendra Clark
Step 6: Knowing “Who Knows Who?” Extends Identities Beyond 360°
Identifies links and identity relationships
Enables powerful case management & analytics
Built on Entity Analytics platform
NAMES
Katherine D. GreenKate Mills-GreenKatie GreenKate MillsKate GreenKate M. Green
ADDRESSES
4737 Cimarron Dr.Easton, MA 02334
1 Bourne St.Bolton, MA 01512
P.O. Box 12743Clinton, MA 01510
ADDITIONAL
DOB: 12/13/71Phone:(508)278-6019
(978)365-6631(501)661-8044
Work: Zycast Int.SilverbackKinear
Step 1Name Standardization
Step 2Address Standardization
Step 3Data Quality
Step 4DataEnhancement
Entity Resolution
Information Management
12
Relationship Resolution and File Folders
Kate JonesEntity #144465
Account: 200898
Phone1: (978) 365-5312
Addr1: 123 Main St.
Addr2: 456 First Street
Grantor: Joe Albert
Kate Green
Phone: (614) 507-5312
Account: 200898
Addr: 984 Mango
Tom Jones
Addr: 123 Main St.
Phone: (978) 365-5312
Joe Albert
Addr: 456 First Street
Phone: (714) 721-4848Fraud File, OFAC, Bad List &
Unauthorized Use Data
Information Management
14
AddressHygiene
Data Quality
DataEnhancement
NameStandardization
IdentityResolution
OFAC Heat
Lists
World Check
Relationship Resolution
Business Directory
Lists of Interest
PEP’s
Relationship Resolution Pipeline Process Association Lists &
Enterprise Interactions
Relationships
Customer Database
15 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
II
| IBM Software Group | DB2 Information Management Software
IBM Confidential
IBM Entity Analytics: Banking Example
WHO KNOWS WHO?WHO IS WHO?
Checking
Mrs. Kate Greene1 Bourne StClinton MA 01510Tel#978-365-5312DOB 07/08/64LIC#1702188364
Wire
Transfers
Banking
Mrs. Kathleen Greenwood10 Sycamor StClifton MA 01510
Tel#614-389-6412 LIC#7102188364
Credit
Card
Ms. Cat Greenspan
1067 6th AveClinton MA 01510TEL#614-389-6412
LIC#170UYRE-8364
DOB 07/09/66
Savings
Banking
Ms. Kathy Greenwall
1 Wallace AveClinton MA 01510Cell#788-365-4431
LIC#170UYRE-8364DOB 07/09/63
?
?
?
?
Branch Manager
Mr. Earl Easygoing
3458 Waggoner RdClinton MA 01510Cell#788-365-4431
LIC#ATB8576-873DOB 07/09/78
Recently Added:
Suspected Fraudster
Mrs. Linda Sweetheart
1067 6th Ave
Clinton, MA 01510
Tel: 376-557-5050
LIC#ATB8576-873Incedent happened: 07/09/04
EnterpriseMulti-Channel
Trust Verification
EmployeeCollusion
Branch Phone Logs (07/09/04)
978-365-6631
614-876-8456
376-557-5050
949-657-8128
513-876-2853
217-314-2127
Fraud Networks
16 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
II
Have Dinner atGreat Restaurant
Transactions
$$$$
However, No one knew that theWaiter used a “Skimmer” to stealTheir credit card information
$$$ $$$ $$$ $$$ $$$
He spent a lot with their Credit Cards……
FDIC Member
Bank
17 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
II
$$$$$$
$$$
$$$
$$$
Have Dinner atGreat Restaurant
Transactions
$$$$
However, No one knew that theWaiter used a “Skimmer” to stealTheir credit card information
He spent a lot with their Credit Cards……
WHERE DID THESE CHARGES COME FROM?!?!?!?!
Excellent question lets find out!!!
FDIC Member
Bank
18 © 2003 IBM Corporation on demand operating environment
2003 IBM Analyst Briefing
II
$$$$$$
$$$
$$$
$$$
Excellent question lets find out!!!
Fraudulent Merchant Co-location
Join
t Inv
estig
atio
n
Same Waiter
FDIC Member
Bank
Information Management
19
Introducing, Anonymous Resolution
Original/Clear-Text data remains with owner
Protect individuals PII*
Share data safely from enterprise-to-enterprise
*PII - Personally Identifiable Information
Information Management
20
Anonymous Resolution Determines “Who is Who & Who Knows Who… Anonymously”
DB2 Anonymous Resolution allows multiple parties to safely and securely share & compare “anonymized” information assets.
Kate Green1 Bourne StClinton Mass, 01510VIN# 585789543Frequent Flyer: 5678965Tel: 501-247-6645PPN: 995027890
Tom Sinclair4909 Battery LaneBethesda MD 20814Acct# 97836553122Acct# 00303450009Tel# 501-603-0882Frequent Flyer: 5678965
tr5y9hU000kdG563ksHjd55603jd98hj4jkf9jt0089gehyud98kkdh00Hydk8880h332jd78001xug00y8880236jslhdu00012g6743kd85hf06h7x084hdf75jc4539fhd89
hd745jf94djd859600dm hdi5667390dj00dsjk00dHji556309hs5392hhs8sK0094gsu5hi94kkd0d0f0084hhdsutqapd9023kjheuia11127s9sndk00dixug00y8880236jslhdu0
hd745jf94djd859600 dmhdi5667390dj00dsj k00dHji556309hs539 2hhs8sK0094gsu5hi 94kkd0d0f0084hhds utqapd9023kjheuia1 1127s9sndk00di xUg00y8880236jslhdu0dmhdi5667390dj00dsj k00dHji556309hs539 2hhs8sK0094gsu5hiaasdasdfa1231LKJSDF//asdflkj
tr5y9hU000kd G563ksHjd55603j d98hj4jkf9jt0089ge hyud98kkdh00Hydk8 880h332jd78001xug 00y8880236jslhdu000 12g6743kd85hf06h7x08 4hdf75jc4539fhd89/SDF G563ksHjd55603d98ghhj4jkf9jt0089geasdf//SKJDkljasdlkjasdkljaf’231254sDDSDFLJK1asd21as3df32
Information Management
21
The One-Way Hash
Result: An irreversible digital code(Not encryption)
Use: Fundamental building block used in modern cryptography
INPUT VALUE
One-Way Hash Value (MD-5)
cbd034409c22929518fa494f99dc9964
OUTPUT VALUEOne-Way
Hash Function
(e.g., MD-5 or SHA-1)
Mark Smith
Information Management
22
The One-Way Hash – Infinitely SensitiveINPUT VALUE
One-Way Hash
Function
One-Way Hash
Function
One-Way Hash Value (MD-5)
56429da8c660e5b1f35e2b2f8ad27c91
One-Way Hash Value (MD-5)
cbd034409c22929518fa494f99dc9964
OUTPUT VALUE
Mark Smith
Mark R Smith
Information Management
23
A-100031 First Name Original cbd034409c22929518fa494f99dc9964
A-100031 First Name Standardized 9269bb3bc60366245144cbd5e960cfd8
A-100031 Last Name Original b835b521c29f399c78124c4b59341691
A-100031 Date of Birth Original 799709b2e5f26f796078fd815bebf724
A-100031 Date of Birth Swapped mm/dd 40ddba83c22acc2acaddff12c66d7adf
A-100031 Date of Birth Circa DOB e4310b75f2fa9595f8154411924b19b1
What’s in this Database?
No Identity Data is in the Clear, Nothing is discarded
Information Management
24
Data Source Anonymization
Output
Anonymizer
Bank ASuspects
Bank B Suspects
Output Output
Bank CSuspects
Anonymizer
Information Management
25
Anonymized Output processed by the Resolver
Anonymous Resolution
Ingest
Output
ALERTS
Real-time
Output Output
Bank A, B and Care all suspicious of the same guy!