+ All Categories
Home > Documents > Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Date post: 27-Dec-2015
Category:
Upload: ursula-powell
View: 218 times
Download: 0 times
Share this document with a friend
14
Copyright © 2011, Oracle and/or its affiliates. All rights reserved.
Transcript

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Big Ideas in Big Data?French-British Workshop on Big Data - London, November 2012

Monica MarinucciDirector of Research, Oracle Global Education & Research Industry Unit

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

0100020003000400050006000700080009000

10000110001200013000140001500016000170001800019000200002100022000

1986 1989 1993 1995 1998 2000 2003 2005 2007 2015 2020

Year

The volume of earth-observation data from European Space Agency’s satellites passed 3PB in 2007 and the projection for 2020 is seven-fold

The volume of worldwide climate data is expanding rapidly, creating challenges for both physical archiving and sharing, for ease of access of relevant information in a multidisciplinary environment

Big Data in Research: Volume

Exponential growth in data and the ability to access critical information

VolumeVery large

quantities of data

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

VelocityExtremely

fast streams of data

In high energy physics, the Large Hadron Collider generates 60TB of data per day

The LOFAR Radio-Interferometre is producing 1.6TB/sec setting new frontiers for radio-astronomy

Big Data in Research: Velocity

Rapid growth in speed of data generation

© CERN

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

VarietyWide range of

data type characteristics

The proposed Large Synoptic Survey Telescope will record 30 trillion bytes of image data every day

In genomics on average scientists can fully sequence 167 individuals per week, generating 250GB of images or 200 movie files

Big Data in Research: Variety

Enterprise infrastructure ability to quickly accommodate new data sources

© CERN

© CERN

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

ValueHigh potential value

if harnessed correctly

In genomics the cost of sequencing is dropping by 50% every 5 months

“… analysis, not sequencing, will be the main expense hurdle” (Chris Ponting , University of Oxford, UK in Feb 2011 Article “Will Computers crash Genomics?”)

Big Data in Research: Value

Ability to translate raw data into information and knowledge

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

New Frontiers in silico

http://compbio.cs.toronto.edu/l

http:// http://onlyhdwallpapers.com

Materials Science: Nanotube compositesNature 447

The Carleton Wind Turbine

http://www.bcu.ac.uk/elss

• (Extremely) Large Data VolumesStorageMetadata

AccessExascale computing

• Global CollaborationsData sets integrationLarge scale simulations & modeling

Context basedVisualisation

• Cross-Discipline ResearchCross-breeding of technology and innovative methods inspired by new

collaborations and exchange of methods and approaches

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Oracle Labs

• To look for novel approaches and methodologies• To focus on real-world outcomes: to develop

technologies that will someday play a significant role in the evolution of technology and society. • 4 main areas:

•Exploratory research•Directed research•Consulting•Product incubation

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Erasmus Medical Centre

Thanks to an Exadata-based solution, Erasmus Medical Centre achieved:

• For a 11 minute query, Exadata could improve it to 1 second, which is a major advantage for researchers to have immediate results

• Smart Scan and Flash Card : give performance in analyzing data.

• Hybrid Columnar Compression : gives performance in the ability to manipulate Tb of data (compression from 133 Gb to 11 Gb), with increased performance.

• Adding Oracle Database 11g features like partitioning gives more performance in manipulating, quantifying data obtained through the study of various genomes

Challenges

Results

• Complex data processing and analysis.• Ability to

• load huge data information in minimum time• store these data and their genomic DNA research results on storage disk• have an efficient system able to give them query performance

More information in the Press Release: Erasmus Medical Center employs Oracle Exadata for DNA researchhttps://emeapressoffice.oracle.com/Press-Releases/Erasmus-Medical-Center-employs-Oracle-Exadata-for-DNA-research-1a0e.aspx

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

PCA CLUSTERS HEATMAP CHEMICAL STRUCTURES CHROMOSOMES

BRAIN ATLAS PATIENT CORRELATION PATHWAY NETWORKS DNA, RNA & PROTEIN SEQUENCING DATA

Visualisation

Ref:

Allele1

Allele2

How is every record related to every other?

What is the range and distribution of values?

What is the range and distribution of values?

What is the range and distribution of values?

Courtesy of Prof. Peter van der Spek, Erasmus Medical Centre

What is the underlying natural sequence variation?

What are the supported regulatory relationships?

How are the numeric attributes correlated?

What are the major themes or concepts?

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Innovating with …

© CERN

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

… however …

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.

Q&AThank you

Copyright © 2011, Oracle and/or its affiliates. All rights reserved.


Recommended