+ All Categories
Home > Healthcare > Monitoring the quality of data in the clinical use of pathogen genomes

Monitoring the quality of data in the clinical use of pathogen genomes

Date post: 10-Feb-2017
Category:
Upload: health-informatics-new-zealand
View: 119 times
Download: 0 times
Share this document with a friend
21
Monitoring the quality of data in the clinical use of pathogen genomes Dr Tom Conway IBM Research Australia
Transcript
Page 1: Monitoring the quality of data in the clinical use of pathogen genomes

Monitoring the quality of data in the clinical use of pathogen genomes

Dr Tom ConwayIBM Research Australia

Page 2: Monitoring the quality of data in the clinical use of pathogen genomes

Uses of Microbial Genomes: Public Health

Infected Food Microbiological Investigation

Cluster Analysis

Page 3: Monitoring the quality of data in the clinical use of pathogen genomes

Uses of Microbial Genomes: Clinical

Antibiotic Resistance Test

Detecting Resistance Genes

Current Practice Genomics Methods

Page 4: Monitoring the quality of data in the clinical use of pathogen genomes

Typical Genomics WorkflowSample Prep

SequencingSequence Data

AnalyticsReporting InterventionSequence Data

Page 5: Monitoring the quality of data in the clinical use of pathogen genomes

Measuring Sequence Quality

Page 6: Monitoring the quality of data in the clinical use of pathogen genomes

Using a Reference Sequence

alignment

Reference

Sequencesthat failed to align to reference

xx

xx x o

x oxo

xx

Sequence fragment

Page 7: Monitoring the quality of data in the clinical use of pathogen genomes

Sequence Data as Words

1: imped, and shivered, and glafed, and growled; and

2: wind was rushing was the sea; and that the smwll

3: nd broad impression of thk identity of things seem

(from Great Expectations with apologies to Charles Dickens)

Page 8: Monitoring the quality of data in the clinical use of pathogen genomes

Sequence Data as WordsGTGGGTTTTTATCGGCTGGCACATGTGTTGGGGTGGGT TTTATC GCTGGC CATGTG TGGGTT TTATCG CTGGCA ATGTGT GGGTTT TATCGG TGGCAC TGTGTT GGTTTT ATCGGC GGCACA GTGTTG GTTTTT TCGGCT GCACAT TGTTGG TTTTTA CGGCTG CACATG GTTGGG TTTTAT GGCTGG ACATGT

Page 9: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 10,000

# di

ffere

nt w

ords

word frequency

Page 10: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 20,000

# di

ffere

nt w

ords

word frequency

Page 11: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 50,000

# di

ffere

nt w

ords

word frequency

Page 12: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 100,000

# di

ffere

nt w

ords

word frequency

Page 13: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 200,000

# di

ffere

nt w

ords

word frequency

Page 14: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 500,000

# di

ffere

nt w

ords

word frequency

Page 15: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 1,000,000

# di

ffere

nt w

ords

word frequency

Page 16: Monitoring the quality of data in the clinical use of pathogen genomes

Accumulating Fragments: 1,000,000

# di

ffere

nt w

ords

word frequency

Page 17: Monitoring the quality of data in the clinical use of pathogen genomes

Differentiating True and False Words

# di

ffere

nt w

ords

word frequency

Page 18: Monitoring the quality of data in the clinical use of pathogen genomes

Estimated Genome Size

Isolate Number

Estim

ated

Gen

ome

Size

(Mbp

)

Page 19: Monitoring the quality of data in the clinical use of pathogen genomes

Isolate Number

True

Wor

d Fr

actio

n

True Word Fraction

Page 20: Monitoring the quality of data in the clinical use of pathogen genomes

Why This is Valuable

Quantifiable Robust

Efficient

Species Independent

ActionableInterpretable

Page 21: Monitoring the quality of data in the clinical use of pathogen genomes

The TeamTomConway

JeremyWazny

JustinBedo

MahtabMirmomeni

BenGoudey

KellyWyres

NatalieGunn

HannahHuckstep


Recommended