(i have no title) joe hourclé essi workshop 2010-08-02 joe hourclé essi workshop 2010-08-02

17
(I have no title) Joe Hourclé ESSI Workshop 2010-08-02

Upload: ashlie-harrison

Post on 13-Dec-2015

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

(I have no title)(I have no title)Joe Hourclé

ESSI Workshop 2010-08-02Joe Hourclé

ESSI Workshop 2010-08-02

Page 2: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

•Note : When reading this presentation at home, view with the ‘Notes’ window visible; I have talking points and other comments in there.

•Note : When reading this presentation at home, view with the ‘Notes’ window visible; I have talking points and other comments in there.

Page 3: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

About MeAbout Me

•Programmer for the Virtual Solar Observatory (VSO)

•Sysadmin & DBA for the Solar Data Analysis Center (SDAC)

•Likes to complain about things

•Has been working for the 18+ months on integrating SDO data into the VSO.

•Programmer for the Virtual Solar Observatory (VSO)

•Sysadmin & DBA for the Solar Data Analysis Center (SDAC)

•Likes to complain about things

•Has been working for the 18+ months on integrating SDO data into the VSO.

Page 4: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

So, the problem ...So, the problem ...•Scientists either don’t know, or don’t

care about informatics issues

•We need to work with the scientists to educate them on how to make their work (data, systems, catalogs, etc) useful to as wide an audience as possible

•We need to stop having every data system designed from the ground up

•Scientists either don’t know, or don’t care about informatics issues

•We need to work with the scientists to educate them on how to make their work (data, systems, catalogs, etc) useful to as wide an audience as possible

•We need to stop having every data system designed from the ground up

Page 5: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

Ignored Issues in e-Science: Collaboration, Provenance and the

Ethics of Data

Ignored Issues in e-Science: Collaboration, Provenance and the

Ethics of Data

Page 6: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

Ignored Issues in e-Science: Collaboration, Provenance and the

Ethics of Data

Ignored Issues in e-Science: Collaboration, Provenance and the

Ethics of Data

Page 7: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

Ignored Issues in e-Science: Collaboration, Provenance and the

Ethics of Data

Ignored Issues in e-Science: Collaboration, Provenance and the

Ethics of Data

Page 8: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

• Provenance can't be a bolt-on. It must be part of the data system from the beginning of the mission. Otherwise, people can cast doubt in the data to refute research they don't like.

• Uncertainties in some data are not straightforward to include in data files. Software should be seen as an alternative source of uncertainty information

• It is impossible to tell in detail exactly how the data was produced. What assumptions were made, what artifacts introduced, what the absolute accuracy is.

• In sensor networks – need annotation of when sensors are swapped out or other discontinuities.

• Provenance can't be a bolt-on. It must be part of the data system from the beginning of the mission. Otherwise, people can cast doubt in the data to refute research they don't like.

• Uncertainties in some data are not straightforward to include in data files. Software should be seen as an alternative source of uncertainty information

• It is impossible to tell in detail exactly how the data was produced. What assumptions were made, what artifacts introduced, what the absolute accuracy is.

• In sensor networks – need annotation of when sensors are swapped out or other discontinuities.

Page 9: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

•How you describe / document time series data is fundamentally different from images & spectra – Collections are hard to define when there isn't a synoptic campaign.

•Software engineering point of view for data :

•How you describe / document time series data is fundamentally different from images & spectra – Collections are hard to define when there isn't a synoptic campaign.

•Software engineering point of view for data :

Page 10: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

Software EngineeringPoint of View for DataSoftware EngineeringPoint of View for Data

Page 11: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

•Need ways to measure how interoperable systems are; types of interop and levels of compliance.

•IRL : Interoperability Readiness Levels. Join the NASA Tech Infusion Working Group.

•IPY is working on a cookbook.

•Need ways to measure how interoperable systems are; types of interop and levels of compliance.

•IRL : Interoperability Readiness Levels. Join the NASA Tech Infusion Working Group.

•IPY is working on a cookbook.

Page 12: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

• Create reward systems for scientists that reward re-usability. (see Townhall Thurs evening)

• Different users have different requirements – do you cater to the general user or all specific cases. Quick search vs. advanced search.

• How do we determine the value of data? Increase in data value if we can reduce uncertainty or increase interop with other data.

• Scale of software – when do you need to bring in a programmer, or a whole team to make it a full project?

• Create reward systems for scientists that reward re-usability. (see Townhall Thurs evening)

• Different users have different requirements – do you cater to the general user or all specific cases. Quick search vs. advanced search.

• How do we determine the value of data? Increase in data value if we can reduce uncertainty or increase interop with other data.

• Scale of software – when do you need to bring in a programmer, or a whole team to make it a full project?

Page 13: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

• (suggestion) YourBadData.org – name and shame the problem data sets.

• Need automatization methos [sic] to process Nexrad data products by extracting only certain grids from a time data series of files, by geographic coordinate and/or location transformation files to readible formats. txt, shp, ...

• Author identities – using pseudonyms to publish fringe work (blogs) ... might later want to merge identities, or might try to disassociate them when trying to get a new job.

• (suggestion) YourBadData.org – name and shame the problem data sets.

• Need automatization methos [sic] to process Nexrad data products by extracting only certain grids from a time data series of files, by geographic coordinate and/or location transformation files to readible formats. txt, shp, ...

• Author identities – using pseudonyms to publish fringe work (blogs) ... might later want to merge identities, or might try to disassociate them when trying to get a new job.

Page 14: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

ConclusionConclusion•We need to raise the informatics issues

in ways that the scientists care about

• They care about error bars; how can we improve their error tracking?

•We need simple guidelines / best practices for good data systems

•We need data & system specialists as stakeholders on new data system projects

•We need to raise the informatics issues in ways that the scientists care about

• They care about error bars; how can we improve their error tracking?

•We need simple guidelines / best practices for good data systems

•We need data & system specialists as stakeholders on new data system projects

Page 15: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02

Solar Dynamics Observatory (SDO)Atmospheric Imaging Assembly (AIA)

171Ångstrom ; 2010/07/08 17:45:48UT ; 2x2 binned

Solar Dynamics Observatory (SDO)Atmospheric Imaging Assembly (AIA)

171Ångstrom ; 2010/07/08 17:45:48UT ; 2x2 binned

Page 17: (I have no title) Joe Hourclé ESSI Workshop 2010-08-02 Joe Hourclé ESSI Workshop 2010-08-02