A provenance-based infrastructure to support the life cycle of executable papers

David Koop*, Emanuele Santos, Phillip Mates, Huy T. Vo, Philippe Bonnet, Bela Bauer, Brigitte Surer, Matthias Troyer, Dean N. Williams, Joel E. Tohline, Juliana Freire, Cláudio T. Silva

*Corresponding author for this work

Research output: Contribution to journalConference articleResearchpeer-review

41 Citations (Scopus)

Abstract

As publishers establish a greater online presence as well as infrastructure to support the distribution of more varied information, the idea of an executable paper that enables greater interaction has developed. An executable paper provides more information for computational experiments and results than the text, tables, and figures of standard papers. Executable papers can bundle computational content that allow readers and reviewers to interact, validate, and explore experiments. By including such content, authors facilitate future discoveries by lowering the barrier to reproducing and extending results. We present an infrastructure for creating, disseminating, and maintaining executable papers. Our approach is rooted in provenance, the documentation of exactly how data, experiments, and results were generated. We seek to improve the experience for everyone involved in the life cycle of an executable paper. The automated capture of provenance information allows authors to easily integrate and update results into papers as they write, and also helps reviewers better evaluate approaches by enabling them to explore experimental results by varying parameters or data. With a provenance-based system, readers are able to examine exactly how a result was developed to better understand and extend published findings.

Original languageEnglish
JournalProcedia Computer Science
Volume4
Pages (from-to)648-657
Number of pages10
ISSN1877-0509
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event11th International Conference on Computational Science, ICCS 2011 - Singapore, Singapore
Duration: 1 Jun 20113 Jun 2011

Conference

Conference11th International Conference on Computational Science, ICCS 2011
Country/TerritorySingapore
CitySingapore
Period01/06/201103/06/2011
SponsorElsevier B.V., University Tsukuba, Center for Computational Sciences

Bibliographical note

Funding Information:
Acknowledgments. We would like to thank all the developers that have contributed to the VisTrails system: Erik Anderson, Louis Bavoil, Clifton Brooks, Jason Callahan, Steve Callahan, Lorena Carlo, Lauro Lins, Tommy Ellkvist, Daniel Rees, Carlos Scheidegger, and Nathan Smith. The research and development of the VisTrails system has been funded by the National Science Foundation under grants IIS-1050422, IIS-0905385, IIS-0844572, ATM-0835821, IIS-0844546, IIS-0746500, CNS-0751152, IIS-0713637, OCE-0424602, IIS-0534628, CNS-0514485, IIS-0513692, CNS-0524096, CCF-0401498, OISE-0405402, CCF-0528201, CNS-0551724, the Department of Energy (SciDAC VACET and SDM centers, and SBIR 85821S08-II), National Institutes of Health (NCRR ARRA), and IBM Faculty Awards (2005, 2006, 2007, and 2008). E. Santos was partially supported by a CAPES/Fullbright fellowship. We also acknowledge support through a grant from the Army Research Office with funding from the DARPA OLE program.

Keywords

  • Executable paper
  • Provenance
  • Reproducibility
  • Scientific workflows

Cite this