data management, analysis, and sharing

31
Abhishek Rathore Senior Scientist (Biometrics) Email: [email protected] Data Management, Analysis & Sharing (ICRISAT) CGIAR Research Program on Dryland Systems 29 th September, 2014, Dubai, UAE

Upload: cgiar-research-program-on-dryland-systems

Post on 14-Dec-2014

106 views

Category:

Environment


4 download

DESCRIPTION

Data management, Analysis, and Sharing

TRANSCRIPT

Page 1: Data management, Analysis, and Sharing

Abhishek RathoreSenior Scientist (Biometrics)

Email: [email protected]

Data Management, Analysis & Sharing (ICRISAT)

CGIAR Research Program on Dryland Systems

29th September, 2014, Dubai, UAE

Page 2: Data management, Analysis, and Sharing

DATAExperiments/

Research Station/ FPVS / FLD / LAB /

Baseline Survey /Adoption

Surveys and etc… AND Kept in

IBP/FieldBooks, Agrobase and etc.

Data Processing:

• Curation

• Analysis

• Interpretation

• PublicationAfter that?

• Case 1: Archival / Storage of Data

• Case 2: Forgot, Lying in xls files & Die with time

• Case 3: Case 1 () but no sharing ()

Data Sharing?

Usual Project Dataflow

Page 3: Data management, Analysis, and Sharing

Most CommonRequests by email?

– May not find when requested (After 3-

4 years)!!!

– Very narrow user base for data

– Only active Googlers may find you

(Through Publications?)

– High probability of data loss!

We Share Data!

Page 4: Data management, Analysis, and Sharing

DATAExperiments/

Research Station/ FPVS / FLD / LAB /

Baseline Survey /Adoption

Surveys and etc… AND Kept in

IBP/FieldBooks, Agrobase and etc.

Data Processing:

• Curation

• Analysis

• Interpretation

• PublicationAfter that?

• Case 1: Archival / Storage of Data

• Case 2: Forgot, Lying in xls files & Die with time

• Case 3: Case 1 () but no sharing ()

Data Sharing?

Data Sharing?

Research Resources & Generated Data Not Used to

Full Potential !

Page 5: Data management, Analysis, and Sharing

ICRISAT Data Management Strategy

Data Curators

Data Manager

Page 6: Data management, Analysis, and Sharing

Desired Way

• Must Haves– Shared in online repositories– Accessible to everybody as IPG– Data Quality Ensured– High Standers of Data Curation– Compressed Files / Raw Data– Links to Publications

• Good to have– Summary Tables (various perspectives)– Data Querying Tool– Say Success Story– Point to Lesson Learned– Show impact

We Share Data!

60-70% Time

GIGO

Page 7: Data management, Analysis, and Sharing

Data Quality: Little Complicated

– Supervised Algorithms/ Scripts• SAS, R , GenStat

– Leverage– Cook’s D– Residual Analysis– Diagnostic Plots– Rep/Season - Rep/Season difference– Text Pattern Search– Other Data sepecific

Page 8: Data management, Analysis, and Sharing

Wish to have…

• A user friendly browse-able online system

– Gives information in a glance

– Graphical representation of Story

– Complete insight in to data

• With availability of Raw / Mean data

• Subset selection and download

• Maintenance free (PI submits & forget), Cloude?

• User control over what is being shared?

– Can share only few varieties for few location

• And in what form (*.xls, jpeg, pdf etc?

We Share Data!

Page 9: Data management, Analysis, and Sharing

Achieved !!!

Page 10: Data management, Analysis, and Sharing

Must HaveCRP- DS Dataverse

Page 11: Data management, Analysis, and Sharing

ICRISAT Dataverse Network - Open Source

Page 12: Data management, Analysis, and Sharing

ICRISAT Dataverse Network - Open Source

Page 13: Data management, Analysis, and Sharing

ICRISAT Dataverse Network - Open Source

Page 14: Data management, Analysis, and Sharing

Database @ ICRISAT Projects

Tropical Legumes-II

Page 15: Data management, Analysis, and Sharing

1 year more education

Double the months on the farm per year

The Simultaneous Triple-View of any data:

Table, Map, Chart

Page 16: Data management, Analysis, and Sharing

Malawi Baseline Survey:

Crop Utilization

Page 17: Data management, Analysis, and Sharing

Tanzania Baseline Survey:

Crop Utilization

Page 18: Data management, Analysis, and Sharing

TL-II Trials (Station/FPVS)

Page 19: Data management, Analysis, and Sharing

India & Bangladesh Trials

Comparing LOCAL variety performance summary.

Page 20: Data management, Analysis, and Sharing

Improved Varieties

Summary of Improved Variety Performance

Page 21: Data management, Analysis, and Sharing

Create LOCAL and IMPROVED

Summary Variables

Page 22: Data management, Analysis, and Sharing

Calculate Comparisons:

Improved to Local Performance Ratio

Page 23: Data management, Analysis, and Sharing

Database @ ICRISAT Projects

HOPE

Page 24: Data management, Analysis, and Sharing

Grain Yield

Village wiselocal vs Improved varieties

Page 25: Data management, Analysis, and Sharing

Follow-up (Pooled over Districts)local vs Improved varieties

Page 26: Data management, Analysis, and Sharing

Follow-up (Year Wise)local vs Improved varieties

Download selected Subset of data

Page 27: Data management, Analysis, and Sharing

Follow-up (Year Wise)local vs Improved varieties

Precipitation

Download selected Subset of data

Page 28: Data management, Analysis, and Sharing

Good & Wish to Have

CRP- DS aWhere

Page 29: Data management, Analysis, and Sharing
Page 30: Data management, Analysis, and Sharing

Quantity of Fertilizer Used:

http://apps.awhere.com/reader/Default.aspx?id=wSSIU7DkTE6L6O-enOxanQ

Income Comparison:

http://apps.awhere.com/reader/Default.aspx?id=KFCQG-O3e0i9lQsbbwerjA

Economy:

http://apps.awhere.com/reader/Default.aspx?id=VwkQAtCdAEmIRPOZiHN94w

Land Size:

http://apps.awhere.com/reader/Default.aspx?id=ThKD4aciLEGkYksBFDaszg

Crop Input/Output Distance:

http://apps.awhere.com/reader/Default.aspx?id=7pxWd3P3N0GKdjseBnq31A

Page 31: Data management, Analysis, and Sharing

Thanks…For more details please contact:

Dr. Abhishek [email protected]