Keywords

integrated modeling; framework; system; multi-model; multi-scale

Start Date

28-6-2018 9:00 AM

End Date

28-6-2018 10:20 AM

Abstract

Numerical frameworks built to conduct multi-sector, multi-scale, and multi-model integrated experiments have historically overlooked the importance of maintaining reusability and reproducibility in modelling, which are fundamental to creating robust software, persistent datasets, and processes that are broadly usable by the scientific community. The Integrated Multi-sector, Multi-scale Modeling (IM3) research team has constructed a modeling system that includes both open-source and proprietary tools. This system facilitates a collaboration framework through which contributors can plan projects, document and capture model and data provenance, promote data sharing across institutions, facilitate best practices and tools for code and data repositories, utilize computational resources ranging from desktops to remote HPC, facilitate data DOI minting, and disseminate the resulting products on the web. We demonstrate the IM3 capabilities by walking through the development of one of the integrated models in the IM3 ecosystem: Demeter. Demeter was developed as an open-source Python package that was built to disaggregate projections of future land allocation generated by an integrated assessment model. Demeter is publicly available via GitHub and is being used by multiple research teams to explore the impacts and implications of land use and land cover change at various scales. This demonstration highlights the by-design process of planning, developing, executing, and sharing to construct reusable software, and reproducible and persistent data and processes in a complex multi-sector, multi-scale modeling ecosystem.

Stream and Session

F4: Replicability and Reproducibility in Research: From Vaporware to Software in Environmental Computing

Share

COinS
 
Jun 28th, 9:00 AM Jun 28th, 10:20 AM

Examples of Best-Practices for Reusability and Reproducibility in a Multi-sector, Multi-scale Modeling Framework

Numerical frameworks built to conduct multi-sector, multi-scale, and multi-model integrated experiments have historically overlooked the importance of maintaining reusability and reproducibility in modelling, which are fundamental to creating robust software, persistent datasets, and processes that are broadly usable by the scientific community. The Integrated Multi-sector, Multi-scale Modeling (IM3) research team has constructed a modeling system that includes both open-source and proprietary tools. This system facilitates a collaboration framework through which contributors can plan projects, document and capture model and data provenance, promote data sharing across institutions, facilitate best practices and tools for code and data repositories, utilize computational resources ranging from desktops to remote HPC, facilitate data DOI minting, and disseminate the resulting products on the web. We demonstrate the IM3 capabilities by walking through the development of one of the integrated models in the IM3 ecosystem: Demeter. Demeter was developed as an open-source Python package that was built to disaggregate projections of future land allocation generated by an integrated assessment model. Demeter is publicly available via GitHub and is being used by multiple research teams to explore the impacts and implications of land use and land cover change at various scales. This demonstration highlights the by-design process of planning, developing, executing, and sharing to construct reusable software, and reproducible and persistent data and processes in a complex multi-sector, multi-scale modeling ecosystem.