idigbio: addressing a bio big data challenge. a. matsunaga, et al. 2013 ieee e-science. 2013: 78-87...

13
iDigBio: Addressing a BIO Big Data Challenge

Upload: cathleen-barton

Post on 19-Jan-2016

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

iDigBio:Addressing a BIO Big Data

Challenge

Page 2: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87

How iDigBio is Different

Page 3: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

The Data Landscape is Changing

https://www.idigbio.org/content/collaborating-institutions

Page 4: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

Research Use & Training

Cloud-BasedData Store

CommunityBuilding

Tools, APIs & Workflows

Outreach/Education

Data Capture/Annotation

Best Practices

Page 5: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

Community Building is Key

Page 6: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87

CI Design: Integrate, Leverage, Re-Use

Page 7: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

now 2050

now 2050

Flatspike sedge (Abildgaardia ovata)

Scrub plum (Prunus geniculata)

C. Germain-Aubrey et al. - 1600 spp., 511,000 GPS

iDigBio Data Lead to Discovery

Page 8: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

Accelerating Digitization of Biodiversity Research Specimens through Online Public Participation

Aiming Up: Natural History Collections as Emerging Resources for Innovative Undergraduate Education in Biology

A Computational- and Storage-Cloud for Integration of Biodiversity Collections

Five task clusters that enable efficient and effective digitization of biological collections

Augmenting optical character recognition (OCR) for improved digitization: Strategies to access scientific data in natural history collections

A workflow for text extraction and parsing for herbarium specimens

Integrating specimen databases and revisionary systematics

Reaching Consensus in Crowdsourced Transcription of Biocollections Information

Semantics in Support of Biodiversity Knowledge Discovery

A Specimen-based View of the World: Using the Biological Collections Ontology to Model Biodiversity Collections

And to Best Practices

Page 9: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

NSF Plays an Active Role

Page 10: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

Bridging Investments: Trees + Specimens + Tools

Page 11: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

http://www.dataone.org

Community Input: Now and Future

Page 12: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

A Model for Community-Driven CI

Page 13: IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al. 2013 IEEE e-Science. 2013: 78-87 How iDigBio is Different

Anne Maglia [email protected] @ammaglia

NSF/BIO/DBI

iDigBio: http://www.idigbio.org