+ All Categories
Home > Documents > An Overview of Intersect’s GDM System

An Overview of Intersect’s GDM System

Date post: 24-Jan-2016
Category:
Upload: sani
View: 17 times
Download: 0 times
Share this document with a friend
Description:
An Overview of Intersect’s GDM System. Joe Thurbon Intersect. Prelude. The Brief. The research activities and communities it supports  The data management and analysis issues it seeks to resolve The types and volumes of data/metadata it will manage  - PowerPoint PPT Presentation
Popular Tags:
16
An Overview of Intersect’s GDM System Joe Thurbon Intersect
Transcript
Page 1: An Overview of  Intersect’s GDM System

An Overview of Intersect’s GDM System

Joe Thurbon Intersect

Page 2: An Overview of  Intersect’s GDM System

Prelude

Page 3: An Overview of  Intersect’s GDM System

The Brief• The research activities and communities it supports

• The data management and analysis issues it seeks to resolve

• The types and volumes of data/metadata it will manage

• The philosophy behind its design (e.g. why use semantic web technologies, if you are)

Page 4: An Overview of  Intersect’s GDM System

Supported Communities

Page 5: An Overview of  Intersect’s GDM System

Issues Addressed

Page 6: An Overview of  Intersect’s GDM System

Issues Addressed

Page 7: An Overview of  Intersect’s GDM System

Data / Metadata

• We keep– All the short reads– All the bioanalyser output– All the trimmed reads– All the tertiary analysis output ‘that works’– All parameters used to generate all of the above• Chemistry versions• Command line parameters• Species, Locations, etc

Page 8: An Overview of  Intersect’s GDM System

Data CountsPlatform / Metric Tissue Samples

Per YearChunks of Data Per Year

Fields of Metadata Per Year

Illumina with standard multiplexing

~2050 > 12000 ~ 100000

454 with standard multiplexing

~3500 > 14000 ~ 120000

Data SizesPlatform / Metric Untrimmed / Year Trimmed / year

Illumina with standard multiplexing

1.2TB 300GB

454 with standard multiplexing

~2.5TB (?) 600GB

Page 9: An Overview of  Intersect’s GDM System

Philosophy

• Anthropology• Epistemology• History and Philosophy of Science• Bio-informatics

• GDM

Page 10: An Overview of  Intersect’s GDM System

What do people do?

Page 11: An Overview of  Intersect’s GDM System

What to Researchers do?

Page 12: An Overview of  Intersect’s GDM System

What do Scientists do?

Page 13: An Overview of  Intersect’s GDM System

What do Bio-Informaticians do?

Page 14: An Overview of  Intersect’s GDM System

What does GDM Do?

• Allows bio-informaticians to– Stand on the shoulders of giants (including

themselves)– Record their observations– Record their experimental parameters– Manage their data

• Iteratively

Page 15: An Overview of  Intersect’s GDM System

Demo

A results corresponds to a single experiment

• What experiment did I do? (steps)• What parameters did I set? (parameters)• What observations did I make? (outputs)• When did I end up with (data)• What did I start from? (parents)

Metadata + Data + Parents = One Results

Page 16: An Overview of  Intersect’s GDM System

GDM Repository

SCU Ramaciotti

ANU

VELVET

Tcoffeee

VELVET

AC3BLAST

BLASTBLASTVELVET

NCBI EMBLDDBJ


Recommended