+ All Categories
Home > Documents > Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application ontology

Date post: 29-Jan-2016
Category:
Upload: desma
View: 41 times
Download: 0 times
Share this document with a friend
Description:
Exploiting semantic technologies to build an application ontology. James Malone PhD, Helen Parkinson PhD, Tomasz Adamusiak Phd, MD. Overview. Motivation Our use cases Annotating HTP experimental data Integrating clinical data Methodology for creating the ontology - PowerPoint PPT Presentation
Popular Tags:
25
Exploiting semantic technologies to build an application ontology James Malone PhD, Helen Parkinson PhD, Tomasz Adamusiak Phd, MD
Transcript
Page 1: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application ontology

James Malone PhD, Helen Parkinson PhD, Tomasz Adamusiak Phd, MD

Page 2: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Overview

• Motivation• Our use cases• Annotating HTP experimental data • Integrating clinical data

• Methodology for creating the ontology• Semi-automated mapping and manual curation

• Current ontology usage• Future use

Page 3: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

22.04.233

Our Use Cases

• Query support (e.g, query for 'cancer' and get also 'leukemia')• Over-representation analysis in groups of samples (analogous to the

use of GO terms in over-representation analysis in groups of genes)• Data visualisation – e.g., presenting an ontology tree to the user of

what is in the database• Data integration by ontology terms – e.g., we assume that 'kidney' in

independent studies roughly means the same, so we can count how many kidney samples we have in the database

• Intelligent template generation for different experiment types in submission or data presentation

• Summary level data • Nonsense detection – e.g. telling us that something marked as

cancer can not be marked as healthy

Page 4: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

4

Scope of Experimental Factor Ontology (EFO)

• Modelling all of the experimental factors that are currently present in the ArrayExpress repository

• Experimental factors are variable aspects of an experiment design which can be used to describe an experiment

• Scope is primarily determined by data currently held in ArrayExpress

species levelsample leveldevelopmental

stage

clinical conditions level (e.g. disease)

Page 5: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Developing an Experimental Factor Ontology22.04.235

‘Experimental Factors’

Page 6: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

22.04.236

Annotating High Throughput Data

• Text mining at data acquisition• Ontology driven queries• Data mining

22.04.236

acquire

246,000

assays

Experiment queries > 200 species

Public/Private

Genes in ExptsRe-annotate

Gene level queries, 9 species

Public Only

ATLASSummarize

Ranked gene/

condition queries

Page 7: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Integrating Clinical Data

• Use cases include: • Homologizing clinical data for study designs (e.g. GWAS)• …

Page 8: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

22.04.238

Building the Experimental Factor Ontology• Position of EFO in the ‘bigger picture’• Key is orthogonal coverage, reuse of existing resources

and shared frameworks

EFO

Disease Ontology Anatomy Reference Ontology

Cell Type Ontology

Chemical Entities of Biological Interest

(ChEBI)

Various Species Anatomy

Ontologies

Relation Ontology

Text mining

Page 9: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Semi-automated mapping text to ontology

• Following an evaluation from Tim Rayner we selected Double Metaphone algorithm

• Perform matching of our values in database to ontology class labels and definitions.

• Also perform mappings from EFO to other ontologies, so that EFO: cancer = NCI: cancer, DO: cancer et al.

• Sanity checking over mappings before adding to ontology

Page 10: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Mapping using Agent Technology

User

Bioontologies

Search Engines

Repositories

Querying Mechanism

Component 2: ontology mappings

Component 3: ontology discovery

Component 1: MAS architecture

Page 11: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

What does agent technology buy us?

• Annotation consistency• EFO_1001214 is now inconsistent

because DO_15654 has new parent

• Richer mappings (hence annotations)• EFO_1000156 can have new mappings

because new cancer class found in MIT ontology

• New potentially relevant ontologies• New ontology found relating to molecular + pathways• Semantic web compatible (i.e. can be deployed as

standards compliant service)

Page 12: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

EFO Axes

EFO

process

material

information

material property

site

Page 13: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Process

process

Page 14: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Information

information

Page 15: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Material

material

Page 16: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Material Property

material property

Page 17: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Using the ontology: Querying

• Public repository of gene expression data• Multiple sources – direct submissions, external databases• >200 species• 8400 experiments, 246,000 assays

22.04.2317

Page 18: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

22.04.2318

Using the ontology: Atlas Querying

Page 19: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Using the ontology: Exposing data via external resources• NCBO Bioportal

Developing an Experimental Factor Ontology22.04.2319

Page 20: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Using the Ontology:Detecting Nonsense: Enforcing correctness

cell line (Hela)

organism part (cervix)

cell type (epithelial)

disease (cervical adenocarcinoma)

species (human)

Page 21: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Using the Ontology:Detecting Nonsense: Enforcing correctness

organism part (hair follicle)

disease (cardiovascular disease)

species (human)

Page 22: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Using Ontology: Integrating Clinical data for Study Design

Page 23: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Future Work for EFO

• Mapping in external ids on request – Snomed-CT, FMA, ChEBI, Brenda tissue ontology etc

• API development for serving external ids from AE• Working with external ontologies to produce cross products• Extensions for clinical data capture Gen2Phen, Engage• Extensions for mouse model of human disease queries• Addressing ‘temporal dimension’• Addition of units• Improving query implementation in ArrayExpress Atlas – GUI

changes• Addition of synonyms • Semantic clustering of experiments

Page 24: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Conclusion

• Ontology development for text mining, annotation, query Built with our needs in mind, however covers a wide range of experimental variables across a wide range of technologies, extensible, open source

• Xref’d to existing ontology resources when possible• Text mining works, reduces the workload• 1.0 is released on April 1st 2009• 0.10 version currently available in OLS and NCBO

bioportal• http://www.ebi.ac.uk/ontology-lookup/browse.do?ontName=EFO• http://www.ebi.ac.uk/microarray-srv/efo/

Page 25: Exploiting semantic technologies to build an application ontology

Exploiting semantic technologies to build an application [email protected]

Acknowledgments• Ontology creation:

• James Malone, Helen Parkinson, Tomasz Adamusiak, Ele Holloway

• Mapping tools and text mining evaluation:• Tim Rayner, Holly Zheng

• External Specialist Review:• Trish Whetzel, Jonathan Bard

• AE Team:• Anna Farne, Ele Holloway, Margus Lukk, Eleanor Williams, Tony

Burdet, Alvis Brazma, Misha Kapushesky• EBI Rebholz Group (Whatizit text mining tool)• EC (Gen2Phen,FELICS,MUGEN, EMERALD, ENGAGE, SLING),

EMBL, NIH

Developing an Experimental Factor Ontology22.04.2325


Recommended