+ All Categories
Home > Documents > Understanding the Challenges with Medical Data Segmentation

Understanding the Challenges with Medical Data Segmentation

Date post: 23-Feb-2016
Category:
Upload: nodin
View: 39 times
Download: 0 times
Share this document with a friend
Description:
Understanding the Challenges with Medical Data Segmentation. Ellick M. Chan, Peifung E. Lam, and John C. Mitchell Stanford University. Health Information Exchange (HIE). Health Information Exchange Cloud. Federal HIPAA HITECH State laws on Mental Health Substance Abuse STDs - PowerPoint PPT Presentation
Popular Tags:
36
TRUST: Team for Research in Ubiquitous Secure Technology TRUST Autumn 2013 Conference October 9-10, 2013 | Washington, DC Understanding the Challenges with Medical Data Segmentation Ellick M. Chan, Peifung E. Lam, and John C. Mitchell Stanford University
Transcript
Page 1: Understanding the Challenges with Medical Data Segmentation

TRUST: Team for Research in Ubiquitous Secure Technology

TRUST Autumn 2013 ConferenceOctober 9-10, 2013 | Washington, DC

Understanding the Challenges with Medical Data SegmentationEllick M. Chan, Peifung E. Lam, and John C. MitchellStanford University

Page 2: Understanding the Challenges with Medical Data Segmentation

2TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Federal– HIPAA– HITECH

State laws on– Mental Health– Substance Abuse– STDs– Genetic testing

Organizational

2

Health Information Exchange Cloud

Health Information Exchange (HIE)

Page 3: Understanding the Challenges with Medical Data Segmentation

3TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Compliance approaches Automated Policy3

>|%%Standard rules for "uses and disclosures"permitted_by_164_502_a(A) :- is_from_coveredEntity(A), is_phi(A), (permitted_by_160_C(A); permitted_by_164_502_a_1(A); required_by_164_502_a_2(A)).

permitted_by_164_502_a_1(A):- permitted_by_164_502_a_1_i(A); permitted_by_164_502_a_1_ii(A); permitted_by_164_502_a_1_iii(A); permitted_by_164_502_a_1_iv(A); permitted_by_164_502_a_1_v(A); permitted_by_164_502_a_1_vi(A).

§ 164.502 Uses and disclosures of protected health information: general rules.(a) Standard. A covered entity may not use or disclose protected health information, except as permitted or required by this subpart or by subpart C of part 160 of this subchapter. (1) Permitted uses and disclosures. A covered entity is permitted to use or disclose protected health information as follows: (i) To the individual; (ii) For treatment, payment, or health care operations, as permitted by and in compliance with §164.506; (iii) Incident to a use or disclosure otherwise permitted or required by this subpart, provided that the covered entity has complied with the applicable requirements of §164.502(b), §164.514(d), and §164.530(c) with respect to such otherwise permitted

HIPAA Law

…010110...

Data segmentation

compliantWithALaw( A )

permittedBySomeClause( A ) notForbiddenByAnyClause( A )

AND

permittedByClause1( A

)

clause1Applicable( A )

meetReqClause1( A )

permittedBySomeRefOfClause1( A )

permittedByClauseRef_I,J( A )

AND

notForbiddenByClause1( A )

notForbiddenByClauseM( A )

clauseMNotApplicable( A )

AND

IHI 2012

According to research by the California HealthCare Foundation, 15 percent

of patients who know their information will be shared would hide

information from their doctor, and another 33 percent would consider hiding

information[1].

Health Record

• Medications• Previous

diagnoses• Labs

Sensitive conditions

Page 4: Understanding the Challenges with Medical Data Segmentation

4TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

4

Adapted from J. Halamka, 2012

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Page 5: Understanding the Challenges with Medical Data Segmentation

5TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

5

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Hide HIV/AIDS ICD-9 code 042

Adapted from J. Halamka, 2012

Page 6: Understanding the Challenges with Medical Data Segmentation

6TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

6

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Zidovudine (INN) or azidothymidine (AZT) is a type of antiretroviral drug used for the treatment of HIV/AIDS.

Side effects: anemia, neutropenia, hepatotoxicity, cardiomyopathy, and

myopathy

+side effects

Adapted from J. Halamka, 2012

Page 7: Understanding the Challenges with Medical Data Segmentation

7TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

7

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Trimethoprim/sulfamethoxazole or co-trimoxazole (abbreviated SXT,

TMP-SMX, TMP-SMZ or TMP-sulfa) is a sulfonamide antibiotic combination of trimethoprim and sulfamethoxazole, in

the ratio of 1 to 5, used in the treatment of a variety of bacterial infections.

+side effects

?

Adapted from J. Halamka, 2012

Page 8: Understanding the Challenges with Medical Data Segmentation

8TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

8

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Prophylaxis (preventative med) for immunocompromised patient?

Patient has urinary tract infection (UTI), plausibly deniable case.

+side effects

Adapted from J. Halamka, 2012

Page 9: Understanding the Challenges with Medical Data Segmentation

9TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

9

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Highest rate of same-sec couples in Provincetown, MA.

Karen Christel Krahulik, Provincetown: From Pilgrim Landing to Gay Resort

, NYU Press, 2007, p. 51.

+side effects

Adapted from J. Halamka, 2012

Page 10: Understanding the Challenges with Medical Data Segmentation

10TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

10

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Candidiasis (thrush) - Candidiasis or thrush is a fungal infection (mycosis).

Commonly causes mouth yeast infections, which manifest as white

patches in the mouth. 15% of immuno-compromised patients may develop

this.

+side effects

Adapted from J. Halamka, 2012

Page 11: Understanding the Challenges with Medical Data Segmentation

11TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Example

Medications1.  Tylenol2.  Sudafed3.  AZT4.  Bactrim

Problem List1.  Headache2.  Sinus Infection3.  HIV positive4.  UTI

11

LetterI hope you and your partner had a great weekend in Provincetown and that the thrush has improved with the mouthwash sample I gave you.

Headaches & HIV: 24/535 patients – 4.5% CDC NHDS 2010 dataset.

115,000 patients.

Mononucleosis-like symptoms

+side effects

Adapted from J. Halamka, 2012

Page 12: Understanding the Challenges with Medical Data Segmentation

12TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Disorders

Cause Manifestations

Cause

Treatments

C a u s eEffects

Page 13: Understanding the Challenges with Medical Data Segmentation

13TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Threat Model

Attacker has direct access to redacted health record, medical literature

Attacker does not have the computational capability to circumvent security mechanisms that protect the primary sensitive codes

Page 14: Understanding the Challenges with Medical Data Segmentation

14TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Treatments

14

Concept Description Related Links Notes

Risperidone Treats schizophrenia, bipolar disorder, and autism.

schizophrenia, bipolar disorder, autism, weight gain, insomnia, alopecia

Use of Risperidone usually implies treatment of a mental health disorder.

Carbamazepine Anti-convulsant and mood-stabilizing drug. Treats epilepsy and bipolar disorder.

epilepsy, bipolar disorder, headaches, drowsiness

Primarily used to treat mental health disorders. Could be used off-label to treat Complex regional pain syndrome(ICD9: 337.21)

Citalopram Primarily used as an SSRI to treat depression. Can also be used to treat hot flashes.

depression, hot flashes, anorgasmia, nausea, diarrhea

Can treat both sensitive and non-sensitive conditions.

Lamotrigine Primarily used as an anticonvulsant drug to treat epilepsy and bipolar disorder. Can also treat migraines.

epilepsy, bipolar disorder, migraines

Can be used to treat mental health disorders or migraines.

Page 15: Understanding the Challenges with Medical Data Segmentation

15TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Treatments

15

Concept Description Related Links Notes

Risperidone Treats schizophrenia, bipolar disorder, and autism.

schizophrenia, bipolar disorder, autism, weight gain, insomnia, alopecia

Use of Risperidone usually implies treatment of a mental health disorder.

Carbamazepine Anti-convulsant and mood-stabilizing drug. Treats epilepsy and bipolar disorder.

epilepsy, bipolar disorder, headaches, drowsiness

Primarily used to treat mental health disorders. Could be used off-label to treat Complex regional pain syndrome(ICD9: 337.21)

Citalopram Primarily used as an SSRI to treat depression. Can also be used to treat hot flashes.

depression, hot flashes, anorgasmia, nausea, diarrhea

Can treat both sensitive and non-sensitive conditions.

Lamotrigine Primarily used as an anticonvulsant drug to treat epilepsy and bipolar disorder. Can also treat migraines.

epilepsy, bipolar disorder, migraines

Can be used to treat mental health disorders or migraines.

Page 16: Understanding the Challenges with Medical Data Segmentation

16TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Formal model

Hypothesis{d1, d3}

{d2, d3}

{d1, d2, d3}

{d1, d2}

16

d1

D

C

m1

M

d2 d3

m2 m3 m4

Reggia’s set cover model• Plausibility – set cover• Likelihood – Occam’s razor and fitness

Page 17: Understanding the Challenges with Medical Data Segmentation

17TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Formal model

Hypothesis{d1, d3}

{d2, d3}

{d1, d2, d3}

{d1, d2}

17

d1

D

C

m1

M

d2 d3

m2 m3 m4

Reggia’s set cover model• Plausibility – set cover• Likelihood – Occam’s razor and fitness

d4

m5

Page 18: Understanding the Challenges with Medical Data Segmentation

18TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Explanation of manifestations

Best explanation E of manifestations:– Covers all observed manifestations M+– Is the simplest (parsimonious) definition

Heuristics for “best cover”– Minimality - |E| is minimal– Criticism: minimal cardinality covers can be too restrictive

Occam’s razor vs Hickam’s dictum– Irredundancy – removing any disorder results in a

non-cover of M+– Relevancy – Every d in D must be causally associated with

some m in M+

18

Page 19: Understanding the Challenges with Medical Data Segmentation

19TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Medical concepts

SensitiveConcepts

AIDS

Psychosis

Alcohol Abuse

Diseases

Manifestations Kaposi’s Sarcoma

Cervical Cancer

Stroke

Delusions

Memory Loss

Rotavirus

Schizophrenia

Page 20: Understanding the Challenges with Medical Data Segmentation

20TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Delusion Hallucination Alcoholism HIV+ Stroke Memory loss

Manifestations

Psychosis Alzheimer’s Disease

Diseases

Source: PubMed, NIH.gov

Page 21: Understanding the Challenges with Medical Data Segmentation

21TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Predicate-Reducer definition

21

RhinovirusMental healthSTDsSubstance abuseX-ray[Free text] R

RhinovirusMental healthSTDsSubstance abuseX-ray[Free text]

A – Medical algorithm/Actionπ – Policy determines sensitive code sM – Medical recordPredicate P(M, π) – Determines if s MReducer R(M, π) – Removes s from M

Ideal reducer

Page 22: Understanding the Challenges with Medical Data Segmentation

22TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Inference approach

Input: Reduce(Diseases U Manifestations U Treatments)

Output: Inferred Diseases

1. For each input, evoke hypotheses2. Evaluate hypotheses3. Rank hypotheses according to fitness

Hypothesis fitness–Competing hypotheses, e.g. d1 or d2

22

Page 23: Understanding the Challenges with Medical Data Segmentation

23TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Algorithm overview

23

R(EHR)

Salient Concepts

Docs

Hypotheses

Extract Concepts

Retrieve Documents

Extract and rank Hypo

EHR: 042 (HIV), 112.0 (Thrush), 136.3 (Pneumocystosis)

Page 24: Understanding the Challenges with Medical Data Segmentation

24TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Algorithm overview

24

R(EHR)

Salient Concepts

Docs

Hypotheses

Extract Concepts

Retrieve Documents

Extract and rank Hypo

EHR: 042 (HIV), 112.0 (Thrush), 136.3 (Pneumocystosis)

EHR: 112.0 (Thrush), 136.3 (Pneumocystosis)

Page 25: Understanding the Challenges with Medical Data Segmentation

25TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Algorithm overview

25

R(EHR)

Salient Concepts

Docs

Hypotheses

Extract Concepts

Retrieve Documents

Extract and rank Hypo

EHR: 042 (HIV), 112.0 (Thrush), 136.3 (Pneumocystosis)

EHR: 112.0 (Thrush), 136.3 (Pneumocystosis)

EHR1, EHR2, …, EHR n

Page 26: Understanding the Challenges with Medical Data Segmentation

26TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Algorithm overview

26

R(EHR)

Salient Concepts

Docs

Hypotheses

Extract Concepts

Retrieve Documents

Extract and rank Hypo

EHR: 042 (HIV), 112.0 (Thrush), 136.3 (Pneumocystosis)

EHR: 112.0 (Thrush), 136.3 (Pneumocystosis)

EHR1, EHR2, …, EHR n

Page 27: Understanding the Challenges with Medical Data Segmentation

27TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Algorithm overview

27

Page 28: Understanding the Challenges with Medical Data Segmentation

28TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Concept Support Index

28

d1

{d1, d2}

Page 29: Understanding the Challenges with Medical Data Segmentation

29TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Hypothesis Fitness Index

29

{d1, d2}

Page 30: Understanding the Challenges with Medical Data Segmentation

30TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Results

30Condition Query Results Medical codes Notes

Rett Syndrome

“wringing” AND “female” AND “constipation” AND ”scoliosis”

3 articles suggest Rett Syndrome.

F84.2, R09.0, K59.0, 737.0

Pubmed

Rett Syndrome

“wringing” AND “female” AND “constipation” AND ”scoliosis”

1.73M results, 5 of top 10 results suggest Rett Syndrome, including NIH Medline.

F84.2, R09.0, K59.0, 737.0

Google

AIDS "Toxoplasmosis" AND "Hepatitis B" AND "Encephalopathy" AND "Progressive multifocal leukoencephalopathy" AND "Cryptococcosis”

140,000 results. 5 of top 10 suggest AIDS.

130, 070.2, 348.30, 046.3, 117.5

Google

AIDS ... 18,000 results. >8 of top 10 suggest AIDS.

130, 070.2, 348.30, 046.3, 117.5

Bing

Page 31: Understanding the Challenges with Medical Data Segmentation

31TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Empirical results

Title

Condition Query Results Notes

Depression V17.0 (Family hist of psych cond), 648.44 (MH postpartum)

#2 hypothesis after MH postpartum. 15,766 discharges.

HCup NIS 2011

Substance abuse, suicide

V60.0 (Lack of housing) Depression and substance abuse common in this group. 32,091 discharges. Top hypotheses: 305.1 (tobacco), V62.84 (suicidal ideation), 292.0 (drug withdrawal)

HCup NIS 2011

HIV 112.0 (Thrush), 136.3 (Pneumocystosis)

HIV is the second hypothesis after the query 136.3. Thrush 112.0 is the 3rd hypothesis. 52,472 discharges.

HCup NIS 2011

Page 32: Understanding the Challenges with Medical Data Segmentation

32TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Possible defenses

32

Deniability through relative strengths of hypotheses Hide non-sensitive EHR as well Enhance competing hypothesis, e.g. Citalopram or

immunosuppression Association rule hiding

d1

D

C

m1

M

d2 d3

m2 m3 m4

d4

m5

Page 33: Understanding the Challenges with Medical Data Segmentation

33TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Future work

Epidemiology

Title

CA Hidalgo, N Blumm, A-L Barabasi, NA ChristakisPLoS Computational Biology (2009).

Towards Precision Medicine. National Research Council, 2011.

Page 34: Understanding the Challenges with Medical Data Segmentation

34TRUST Conference – Nov. 15-16, 2012 – Washington, DC

A message from our sponsors…

34

We thank:

• Carl Gunter and Mike Berry - predicate-reducer model

• James Reggia - formalization of the hypothetico-deductive model

• Brad Malin - helpful resources

• Ivan Handler - health information exchange level

• Fisayo Ositelu - medical insight.

Page 35: Understanding the Challenges with Medical Data Segmentation

35TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Questions?

35

Page 36: Understanding the Challenges with Medical Data Segmentation

36TRUST Conference – Nov. 15-16, 2012 – Washington, DC

Ask your physician!

36


Recommended