Date post: | 14-Apr-2017 |
Category: |
Documents |
Upload: | troy-martin |
View: | 41 times |
Download: | 2 times |
turnkey data for healthcare heroes
2
data science today
3
problem areas
domain expertisePublic & proprietary datasets are spread
across many catalogs, not all online, so
finding the right dataset is time consuming
privacy & complianceDatasets have different owners, so
complying with multiple licenses, attribution
& reporting terms is an ongoing burden
data engineeringFormatting, optimizing and loading data into
your big data or data science platform of
choice requires substantial effort
data qualityEach dataset has different errors, missing
values, outliers, gaps, flurries, biases, typos –
requiring substantial manual effort to clean
data evolutionDatasets are updated on different schedules,
creating an operational burden to keep them
up to date
data integrationDatasets from different sources give different
meaning & assumptions to similarly named
concepts, making joins semantically wrong
4
DataOps defined
domain expertise◦ Find or create the right data sets
◦ Enrich by experts or by joining data sources
◦ Access to proprietary or hard-to-find data
privacy & compliance◦ Data license compliance support
◦ Ongoing reporting, attribution, disclosures
◦ Data monetization & secure licensing
data engineering◦ Format data optimally for target platform
◦ Automatically load the data & metadata
◦ Auto-update governance & lineage catalogs
data quality◦ Automated cleansing & validation rules
◦ Curated and auto-validated metadata
◦ Quality scores and beyond: Find outliers,
gaps, flurries, biases & provenance issues
data evolution◦ Track changes in all source data
◦ Deliver clean, versioned updates daily
◦ Support overwrite-on-update
◦ Mappings for terminology changes
data integration◦ Semantic inter-operability
◦ Unified type system & constraints
◦ Unified metadata specification
◦ Automated mappings to data platforms
5
how it works
tell us what you’re buildingWe have clinicians & data science experts who
speak your language. Just explain your goal,
your platform and what help you need.
we’ll prepare the dataWe will research, curate, clean, license,
format, load, update & document all the
datasets your project requires.
so you can go build itFocus on data science, and leave data
operations to us. We take care of updates,
integration, compliance and support you.
I want to predict patients at risk
for chronic kidney disease
I want to automatically generate
ICD-10 codes from clinical notes
I want to auto-recommend diets
that match patients’ treatment plan
I want to monitor and alert on
shifts in drug pricing & shortages
Continuously tested with the latest big data
& data science platform for one-step load
Ready to load
Up to 100x speedup on Hadoop & Spark
clusters thanks to Parquet serialization
Optimized
It’s as if all your health data came from one
clean source. Somewhere, pigs are flying.
Inter-operable
Get updates as they happen, not worrying
worry about broken schemas or identifiers.
Always up to date
Turnkey data
Clean30+ automated validation rule sets run
on the data and metadata to ensure
correct, complete, same representation
– including units, currencies, locations,
timestamps, dates and missing values
Problem specificData provenance (sampling method,
data collection methodology, publisher,
conflicts of interest, freshness, gaps)
documented and verified by a domain
expert against your project
CompliantKnow in advance you have the right
data license for your business model,
geographic target and team. Remain
proactively compliant with reporting,
audits, attribution and privacy terms.
Quality data
8
domain expertise
88%
On our team
% MSc or MA
36%
On our team
% MD or PhD
PharmaClinical Revenue
Cycle
Public
HealthCyber
9
let’s connect
address
16192 Coastal Highway
Lewes, DE 19958, USA
( +1 (302) 786-5227
online
www.JohnSnowLabs.com
twitter.com/johnsnowlabs
company/johnsnowlabs
thank you.
© 2016 John Snow Labs Inc. All rights reserved. The John Snow Labs logo is a trademarks of John Snow Labs Inc. The included information is for informational purposes only and represents the current
view of John Snow Labs as of the date of this presentation. Since John Snow Labs must respond to changing market conditions, it should not be interpreted to be a commitment on its part, and John Snow
Labs cannot guarantee the accuracy of any information provided after the date of this presentation. John Snow Labs makes no warranties, express or statutory, as to the information in this presentation.