Date post: | 17-Nov-2014 |
Category: |
Technology |
Upload: | tom-loughran |
View: | 478 times |
Download: | 0 times |
Jeff Sale and Diane BaxterSan Diego Supercomputer CenterUniversity of California, San Diego
Data: Evidence to unravel themysteries of our Universe
Data provide answers to our children’s single most persistent question:
How do you know that?
Data Come From Every Field . . .
Astronomy
Physics
Life Sciences
Modeling and Simulation
Data Managementand Mining
GAMESS
Geosciences
And are shared around the world.
SDSC
PRAGMA: Pacific Rim GridMiddleware Consortium TeraGrid: National
Research Resource Grid
GEON: GeosciencesGrid
BIRN: Biomedical Informatics Grid
Open Science Grid: Physics-driven Grid
infrastructureNEES: Earthquake
Engineering Grid
DisciplinaryDisciplinaryDatabasesDatabasesUsersUsers
Portals, Domain Portals, Domain Specific APIsSpecific APIs
provide accessprovide accessto datato data
MiddlewareMiddlewarefederates datafederates data
across disciplinaryacross disciplinaryvocabulariesvocabularies
OrganismsOrganisms
OrgansOrgans
CellsCells
AtomsAtoms
BiopolymersBiopolymers
OrganellesOrganelles
Cell BiologyCell Biology
AnatomyAnatomy
PhysiologyPhysiology
ProteomicsProteomics
Medicinal ChemistryMedicinal Chemistry
GenomicsGenomics
Life Sciences
How much data are we producing*?
Kilo 103
Mega 106
Giga 109
Tera 1012
Peta 1015
Exa 1018
1 human brain at the
micron level = 1 PetaByte
1 novel = 1 MegaByte
iPod Shuffle (up to 120 songs) = 512 MegaBytes
Printed materials in the Library of Congress = 10 TeraBytes
SDSC HPSS tape archive = 25 PetaBytes and growing
All worldwide information in one year = 2 ExaBytes
1 Low Resolution
Photo = 100 KiloBytes
* Rough/average estimates
1 DVD = 9.4 GigaBytes
Computational tools are essential to comprehend that much data!
Integrate vast data collections from a wide variety of collection points
Visualize empirical results
Create mathematical models based on complex, interconnected data
Computational ModelsExtend beyond data to:
Predict
Ask “what if” questions
Evaluate alternate hypotheses
Visualize vast, complex data
collectionsManipulate multiple variables
Why is Data Literacy So Essential?Data = the foundation of science
Data shared can solve problems
Data can bridge and connect fields, ideas and peopleComputation, the “third leg” of research, depends upon data
From atomic interaction data that form a model of a molecular dynamics
To light emission data from long-dead stars that explain the origins of the Universe
Teaching about data means teaching the “language and currency” of science.
A Window to Data
The Discover Data Education Portal