Contents
1. Open Data and Data Sharing 2. Barriers to mainstreaming data sharing
3. Global Initiatives
4. CGIAR success stories
5. ICRISAT Data Management Strategy
6. Open Data promotional activities-ICRISAT
7. ICRISAT Data repositories
What is Open Data?
Open data is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyrights, patents or other mechanisms of control.
interoperability
Characteristics:
Non-Proprietary
East to access
Easy to use Machine readable
Reusable without license
No cost Interoperability
Re distribute
Non- personal
data
Easy Access as well as Open Access is required to ensure the most effective use of research results
Building Blocks to Open Data
Leadership and bureaucratic
support Datasets Licences
Data standards Data portals Interpretations, interfaces and applications
Capacity building
Feedback loops Policy and
legislative lock-in
Data Sharing
Research using public fund
Increases the impact and visibility of research
Avoiding replication
Leads to new collaborations and partnerships
• Issues of intellectual property rights • commercial use
Barriers to mainstreaming data sharing
• Data confidentiality ( Ex: personal information)
• Data standards and relationship between
interdisciplinary data ( metadata , RDBMS, curation, legacy data,
more resource and time needed)
• Recognition and data authorship ( ownership and right to
reproduce belongs to institute , authorship to sec. data)
• Data preservation beyond project life cycle( Long term
preservation for future use, project continuity. This has to be done during the project with project plan)
Open Agricultural Research Data Global Initiatives
GFAR The Global Forum on Agricultural Research
CGIAR
CIARD Coherence in Information for Agricultural Research for
Development
World Bank
USAID The United States Agency for International Development
GODAN (G8- collaboration of US and UK Open Data) Global Open Data for Agriculture and Nutrition
Open Agricultural Research Data Global Initiatives
FAO : http://data.fao.org
USA : https://www.data.gov
UK : http://data.gov.uk/data
CGIAR open data initiatives
November 2013, the CGIAR Consortium hosted a Data Standards Summit
Generation Challenge Program ( GCP) http://www.generationcp.org AgTrials : http://www.agtrials.org
ASTI : www.asti.cgiar.org
Ethiopia Rural Household Surveys
Chronic Poverty and Long Term Impact Study in Bangladesh
Land Degradation Surveillance Framework : http://gsl.worldagroforestry.org/?q=node/239
Poverty Environmental Network : http://www.cifor.org/pen
VDSA : http://vdsa.icrisat.ac.in/vdsa-vls.htm
Cassavabase : www.cassavabase.org
CIAT Geonetwork Intergenebank Pototo Database
AGROVOC Open AGRIS
SINGER : Systems-Wide Information Network for Genetic Resources
CGIAR open data initiatives
ICRISAT Data Management Strategy
• Establishing a process
• System availability
• Cultural change
• Supporting mechanism
• Working with the CGIAR Consortium Office Decentralised data management platform with central data repository
Need for Data Management
• Centralized Data Repository • Data Backup/Archiving • Secured data • Data Sharing • Store the data in different formats for the
future needs • Data quality assurance and control • Decision Making by the Leadership
Centre
Research data
Management
Policy
Data Management
Unit Geoinformatics Unit
Biometrics
Unit
Centralized Data
Archiving & sharing
Africa Rice YES YES YES YES Since July 2012
Bioversity In process YES Since Sept. 2013
CIAT YES YES YES In process
CIFOR YES In process
CIMMYT YES Recruiting YES YES In process
CIP YES YES YES YES YES
ICARDA In process YES YES YES In process
ICRAF YES YES YES YES Since 2011
ICRISAT YES YES YES YES YES
IFPRI YES Recruiting No Since 2005
IITA In process In process YES YES In process (Partial shared)
ILRI YES YES YES YES servers, data partial in
development
IRRI (Currently being
updated) YES YES YES In process
IWMI YES YES YES YES
World Fish YES YES YES YES
Research data infrastructure across the CGIAR centers
Open Data promotional activities@
• Open Access Week
• Open access and Data Management policy at ICRISTAT
• Capacity building activities
• Technology Infrastructure establishment
• Data loss prevention initiative at institute and
individual researcher level
Data Repositories @
• OAR and Dataverse
• Village Dynamics in South Asia (VDSA) data warehouse
• ICRISAT- aWhere Platform: Cloud based M&E and data sharing platform
• AGROBASE
• Genetic resources
• Integrated Breeding Platform (IBP)
• EXPLOREit @ ICRISAT
• ResourceSpace
Online Data storage and sharing capabilities; Integrated system for Baseline, Adoption survey and Trail data management; Research analysis with spatial integration; Cloud computing
TL2 & HOPE- Spatial Data management
1. Integrate socio–economic data into warehouse system 2. Farm Field level information to the users 3. Online Analytical Reports 4. Village Dynamics Database
VDSA-Socio Economics Data management
Data entry
Operators
VDSA Data Management Workflow (Village Level Studies)
Data Digitization
Data
Collection
Primary/
Raw Data
Data
cleansing
and Quality
checks
Data Organization
(Identifier, Schedule
name etc)Data entry
using CSPRO
CSPRO
Database
Field Investigator
Data Manager
Is the data
correct?
[Yes]
[No]
Data
Investigation
Schedule
[Export]
AGROBASE- Breeding Data management
1. Good pedigree management 2. Generating experimental design plan 3. Managing the genetic data using RDBMS 4. Quick Data Analysis for multi location
experiments 5. Generating print field layouts
Integrated Breeding Platform (Generation Challenge Program)
Breeding Data management
Integrated Breeding Platform (Generation Challenge Program)
• Web based – one stop shop for breeding information
• Integrated system to help day to day activities of the modern plant breeding
• Centralized platform for the partners, funders, researchers
• Goal to boost crop productivity and resilience
Tablet based data collection tools Benefits: • Significantly lower expenses on long term basis
• Time savings in data integration
• Richer, more complete and more accurate data
• Remote deployment to data generators