Keywords

HydroShare, Reproducibility, Data Sharing, Data Publication

Start Date

28-6-2018 9:00 AM

End Date

28-6-2018 10:20 AM

Abstract

Abstract: HydroShare is a web-based hydrologic information system operated by the Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI). Within HydroShare, users can create and share data and models using a variety of file formats and flexible metadata. HydroShare enables users to formally publish these resources as well as create linkages between published data and model resources and peer reviewed journal publications that describe them. Ability to link published data and models with the papers that describe them is a great step in the direction of scientific reproducibility, but is only a first step. HydroShare supports further transparency in the scientific process by enabling scripting of analytical steps via a RESTful application programming interface (API). Using this API, HydroShare users can develop scripts to read data from HydroShare, perform an analytical step (e.g., data processing or visualization), and then write results back to HydroShare. The script itself can then be shared as part of the published dataset in HydroShare, or it can be shared as a Jupyter Notebook that can be executed within the HydroShare environment. Scripts or Jupyter Notebooks can then be executed by others to reproduce the analysis used by the original authors. In this presentation, we discuss how HydroShare can enable best practices for linking publications with data and models and for promoting reproducibility in environmental analyses through sharing of data, models, and scripts that encode the scientific workflow. The HydroShare system is available at http://www.hydroshare.org. Source code for HydroShare is available at https://github.com/hydroshare.

Stream and Session

Stream F, Session F4: Replicability and Reproducibility in Research: From Vaporware to Software in Environmental Computing

COinS
 
Jun 28th, 9:00 AM Jun 28th, 10:20 AM

Using HydroShare to Enhance Sharing and Reproducibility of Research Results

Abstract: HydroShare is a web-based hydrologic information system operated by the Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI). Within HydroShare, users can create and share data and models using a variety of file formats and flexible metadata. HydroShare enables users to formally publish these resources as well as create linkages between published data and model resources and peer reviewed journal publications that describe them. Ability to link published data and models with the papers that describe them is a great step in the direction of scientific reproducibility, but is only a first step. HydroShare supports further transparency in the scientific process by enabling scripting of analytical steps via a RESTful application programming interface (API). Using this API, HydroShare users can develop scripts to read data from HydroShare, perform an analytical step (e.g., data processing or visualization), and then write results back to HydroShare. The script itself can then be shared as part of the published dataset in HydroShare, or it can be shared as a Jupyter Notebook that can be executed within the HydroShare environment. Scripts or Jupyter Notebooks can then be executed by others to reproduce the analysis used by the original authors. In this presentation, we discuss how HydroShare can enable best practices for linking publications with data and models and for promoting reproducibility in environmental analyses through sharing of data, models, and scripts that encode the scientific workflow. The HydroShare system is available at http://www.hydroshare.org. Source code for HydroShare is available at https://github.com/hydroshare.