CCE LTER: Information Management (2004-2006)
KSBaker, SJJackson, and JRWanetick, 2005. Strategies Supporting Heterogeneous Data and Interdisciplinary Collaboration: Towards an Ocean Informatics Environment.Proceedings of the 38th Hawaii International Conference on System Sciences. HICSS38, IEEE Computer Society, 2-6 January 2005, Big Island, Hawaii.
HKarasti, KSBaker, and EHalkola, in press. Enriching the Notion of Data Curation in e-Science: Data Managing and Information Infrastructuring in the Long TermEcological Research (LTER) Network. In M. Jirotka, R. Procter, T. Rodden, and G. Bowker (eds). Computer Supported Cooperative Work: An International Journal.Special Issue: Collaboration in e-Research.
KSBaker and KStocks. Building Environmental Information Systems: Myths and Interdisciplinary Lessons. Proceedings of the 40th Hawaii International Conference onSystem Sciences (HICSS) 2007, 3-6 January, Big Island, Hawaii, pp. 1-10, IEEE Computer Society, New Brunswick, NJ, 2007
KSBaker and FMillerand. Articulation Work Supporting Information Infrastructure Design: Coordination, Categorization, and Assessment in Practice. Proceedings of the40th Hawaii International Conference on System Sciences (HICSS), 2007, 3-6 January, Big Island, Hawaii, pp. 1-10, IEEE Computer Society, New Brunswick, NJ, 2007
Eventlogger
Karen S. Baker, Lynn Yarmey, Mason Kortz, Shaun Haber, Jerry Wanetick, and Florence MillerandUniversity of California San Diego, Scripps Institution of Oceanography and University of Quebec, Montreal
2.0
http://cce.lternet.edu
Information Infrastructure Strategies
Collaborative Environment: Partnerships
CCE Site Environment Field Environment
metadata workincomporation & format
eventlogging
visualizing
sampling
web delivery
quality control
PRE-PROCESSING
local sharing
acquisition planning& data scoping
data transport
data exchange
repository storage
documenting
The California Current Ecosystem LTER is acoastal upwelling biome off the coast ofCalifornia. An interdisciplinary group is workingto understand and communicate the effects of longterm climate variability on the California Currentpelagic ecosystem. The CCE site became part ofthe LTER network in 2004 and is based at theScripps Institution of Oceanography - Universityof California, San Diego.
An automated Eventlog System is astep towards integrating diversemeasurements through a commontime and location stamp within ashared authoritative file created asdata is collected. It is in development,prototyped on a series of cruises withdesign modifications introduced aftereach deployment.
The CCE LTER is building a contemporary information environment - OceanInformatics - focusing on participant engagement, process-building, and localdesign. Ocean Informatics is a community of practice emerging to meet thechallenges of articulating requirements and collaborative design in support ofheterogeneous data collections and information management practices. Ourfocus is on developing processes that recognize intertwined technological,organizational, and social factors nherent to design work. Our goal is to createan adaptive information infrastructure that facilitates long-term science (Baker,Jackson, Wanetick, 2006).
Ocean Informatics Environment
LTER and Data Stewardship
Data ManagementElements
quality assurance
DATA COLLECTING
data ingestion
PROCESSING
LTER IM/All-Scientists Meeting, Sep 2006
DATA SHARING
archival storage
Design Environment
Collaborative Data Environment
Who are the data stewards?
Are there other key elements
In your design environment?
LTER represents a unique setting for data stewardship, characterizedand challenged by a long-term science perspective coupled with anopen data sharing policy for primary data and a highly distributedinterdisciplinary collaborative environment. Understandings of theextent and scope of data stewardship are beginning to unfold.
Data stewardship involves recognizing the value of maintainingestablished data collections. Local data stewardship guides datamanagement efforts, engagement, and planning for change along sideactive scientific programs, so that data is available for local researchyet research and network science informs data handling and datamanagement activities (Karasti, Baker and Halkola, in press)
Shared tools: apache, php,html, css
Shared tool use: forms,plotting, and visualization
Gather & share site contentand data for participants,partners, and public
Process cruise page, gridcalculators, dynamicelements such as mapper,bibliographic harvest file
Design and support of projectweb presence
Webhttp://cce.lternet.edu
Two-component achitecture model:highly structured (ie automatednavigation and dadta exchange) andless structured (table specific localneeds)
Focus on open source architectureand tools: mySQL, php, templates,xml and eml
Database schema design and ties toweb delivery
Establish a cross-project data systemthat is flexible yet queriable data andmetadata repository capable of onlineweb delivery
DataZoo
Shared tools:mySQL, perl
Shared architecture:design for community-sidedata and user needs usingshared tools
Create reuseable code andshared data structures;provide ease of data entryinto system as well asimmediate mapping to localand community standardsand formats
Bibliographic, mediagallery, personnel directory,unit and attributedictionaries and mappings
Create extensibleinformation systemelements
SharedModules
Cruise planning, resource planning, science-information management joint planning
Partner with those carrying outinfrastructure studies, socialinformatics, design studies, andethnographic work
Design schema working group,user groups, and reading groups
Information exchange events, disksharing,blogs, forums, shared storage arrays, webservices
Examples
Research and prototype informaticstechniques valuable for ongoing work withfield research programs while establishingand maintaining a bidirectional dialoguebetween domain scientists and data workers
Introduce interdisciplinarycollaborative projects with capacityto inform development ofinfrastructure
Design studio, design teams,and communities of practice
Identify shared resources and facilitatecommunity building by focusing oncommunication and mutual learningframed by codesigning joint tasks
Infrastructure
Elicit and coordinate community-wideneeds through dialogue, developing workingstandards, documentation, and self-assessment
Identify, design, develop, deploy, enact, andsupport improved data planning andprocesses
DataWork
Engage in participatory actionresearch with social scientists withexpertise in science studies,ethnography, communication, andhuman-computer interface
Identify and align technical, social,organizational and communityelements that are part of the dataprocesses
ArticulationWork
Use collaborative design,comparative analysis, andshared resources
Augment and prototype shared services(storage, computation, web services) andcollaborative technologies (contentmanagement systems, webdav remotemount disks)
Mechanisms
Build long-term (adaptive)information infrastructure
Enhance project data handling capabilitiesby creating a cross-project informaticsenvironment
Goals
Sociotechnical DesignStrategies
CollaborativeStrategies
Elements
References
Short-Term Perspective Long-Term Perspective
Technology solution driven Science inquiry driven
Digital maintainability Data sustainability
Data deluge concerns Data sharing concerns
Data grid structures Information infrastructure arrangements
Metadata enactment Data description development
Data curation procedures Data stewardship practices
Scientific Timeframes and their Features
(Karasti et al, 2006)