chukka srinivasa rao data management unit -...

27
Chukka Srinivasa Rao Data Management Unit - ICRISAT

Upload: others

Post on 25-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Chukka Srinivasa Rao Data Management Unit - ICRISAT

Contents

1. Open Data and Data Sharing 2. Barriers to mainstreaming data sharing

3. Global Initiatives

4. CGIAR success stories

5. ICRISAT Data Management Strategy

6. Open Data promotional activities-ICRISAT

7. ICRISAT Data repositories

What is Open Data?

Open data is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyrights, patents or other mechanisms of control.

interoperability

Characteristics:

Non-Proprietary

East to access

Easy to use Machine readable

Reusable without license

No cost Interoperability

Re distribute

Non- personal

data

Easy Access as well as Open Access is required to ensure the most effective use of research results

Open Access to Publication

Open Data

Open Access and Open Data

Building Blocks to Open Data

Leadership and bureaucratic

support Datasets Licences

Data standards Data portals Interpretations, interfaces and applications

Capacity building

Feedback loops Policy and

legislative lock-in

Data Sharing

Research using public fund

Increases the impact and visibility of research

Avoiding replication

Leads to new collaborations and partnerships

• Issues of intellectual property rights • commercial use

Barriers to mainstreaming data sharing

• Data confidentiality ( Ex: personal information)

• Data standards and relationship between

interdisciplinary data ( metadata , RDBMS, curation, legacy data,

more resource and time needed)

• Recognition and data authorship ( ownership and right to

reproduce belongs to institute , authorship to sec. data)

• Data preservation beyond project life cycle( Long term

preservation for future use, project continuity. This has to be done during the project with project plan)

Open Agricultural Research Data Global Initiatives

GFAR The Global Forum on Agricultural Research

CGIAR

CIARD Coherence in Information for Agricultural Research for

Development

World Bank

USAID The United States Agency for International Development

GODAN (G8- collaboration of US and UK Open Data) Global Open Data for Agriculture and Nutrition

Open Agricultural Research Data Global Initiatives

FAO : http://data.fao.org

USA : https://www.data.gov

UK : http://data.gov.uk/data

CGIAR open data initiatives

November 2013, the CGIAR Consortium hosted a Data Standards Summit

Generation Challenge Program ( GCP) http://www.generationcp.org AgTrials : http://www.agtrials.org

ASTI : www.asti.cgiar.org

Ethiopia Rural Household Surveys

Chronic Poverty and Long Term Impact Study in Bangladesh

Land Degradation Surveillance Framework : http://gsl.worldagroforestry.org/?q=node/239

Poverty Environmental Network : http://www.cifor.org/pen

VDSA : http://vdsa.icrisat.ac.in/vdsa-vls.htm

Cassavabase : www.cassavabase.org

CIAT Geonetwork Intergenebank Pototo Database

AGROVOC Open AGRIS

SINGER : Systems-Wide Information Network for Genetic Resources

CGIAR open data initiatives

ICRISAT Data Management Strategy

ICRISAT Data Management Strategy

• Establishing a process

• System availability

• Cultural change

• Supporting mechanism

• Working with the CGIAR Consortium Office Decentralised data management platform with central data repository

Need for Data Management

• Centralized Data Repository • Data Backup/Archiving • Secured data • Data Sharing • Store the data in different formats for the

future needs • Data quality assurance and control • Decision Making by the Leadership

ICRISAT Data management -workflow

Centre

Research data

Management

Policy

Data Management

Unit Geoinformatics Unit

Biometrics

Unit

Centralized Data

Archiving & sharing

Africa Rice YES YES YES YES Since July 2012

Bioversity In process YES Since Sept. 2013

CIAT YES YES YES In process

CIFOR YES In process

CIMMYT YES Recruiting YES YES In process

CIP YES YES YES YES YES

ICARDA In process YES YES YES In process

ICRAF YES YES YES YES Since 2011

ICRISAT YES YES YES YES YES

IFPRI YES Recruiting No Since 2005

IITA In process In process YES YES In process (Partial shared)

ILRI YES YES YES YES servers, data partial in

development

IRRI (Currently being

updated) YES YES YES In process

IWMI YES YES YES YES

World Fish YES YES YES YES

Research data infrastructure across the CGIAR centers

Open Data promotional activities@

• Open Access Week

• Open access and Data Management policy at ICRISTAT

• Capacity building activities

• Technology Infrastructure establishment

• Data loss prevention initiative at institute and

individual researcher level

Data Repositories @

• OAR and Dataverse

• Village Dynamics in South Asia (VDSA) data warehouse

• ICRISAT- aWhere Platform: Cloud based M&E and data sharing platform

• AGROBASE

• Genetic resources

• Integrated Breeding Platform (IBP)

• EXPLOREit @ ICRISAT

• ResourceSpace

ICRISAT- Dataverse

http://dataverse.icrisat.org/dvn

Online Data storage and sharing capabilities; Integrated system for Baseline, Adoption survey and Trail data management; Research analysis with spatial integration; Cloud computing

TL2 & HOPE- Spatial Data management

1. Integrate socio–economic data into warehouse system 2. Farm Field level information to the users 3. Online Analytical Reports 4. Village Dynamics Database

VDSA-Socio Economics Data management

Data entry

Operators

VDSA Data Management Workflow (Village Level Studies)

Data Digitization

Data

Collection

Primary/

Raw Data

Data

cleansing

and Quality

checks

Data Organization

(Identifier, Schedule

name etc)Data entry

using CSPRO

CSPRO

Database

Field Investigator

Data Manager

Is the data

correct?

[Yes]

[No]

Data

Investigation

Schedule

[Export]

AGROBASE- Breeding Data management

1. Good pedigree management 2. Generating experimental design plan 3. Managing the genetic data using RDBMS 4. Quick Data Analysis for multi location

experiments 5. Generating print field layouts

Integrated Breeding Platform (Generation Challenge Program)

Breeding Data management

Integrated Breeding Platform (Generation Challenge Program)

• Web based – one stop shop for breeding information

• Integrated system to help day to day activities of the modern plant breeding

• Centralized platform for the partners, funders, researchers

• Goal to boost crop productivity and resilience

Tablet based data collection tools Benefits: • Significantly lower expenses on long term basis

• Time savings in data integration

• Richer, more complete and more accurate data

• Remote deployment to data generators

Open Data Kit (ODK)

ICRISAT is a member of the CGIAR Consortium