Workshop on Provenance for Transparent Research

The public and the press already expect to assess the trustworthiness of research relevant to pressing social and public health issues in terms of transparency. While reliable provenance is widely recognized as a critical component of research reproducibility in principle, its promise for making research fully transparent—and scientific claims easier to evaluate—has yet to be realized in full. In particular, it is still far from routine for researchers in the natural, social, and data sciences to assess the trustworthiness of reported results using automatically captured provenance information.

This workshop aims to engage Provenance Week 2021 attendees in a focused conversation about how methods for automated provenance capture, storage, query, inference, and visualization can make research more transparent and the trustworthiness of results easier to evaluate, both by other researchers and by the public. In brief presentations speakers will propose actionable definitions of terms such as transparent, trustworthy, and traceable; identify needs of particular research communities and other stakeholders; prioritize desiderata for real-world system implementations; and highlight remaining research and engineering challenges. All workshop participants will be invited to comment and contribute their own definitions, priorities, and user requirements in real time via shared documents. The suggestions will be ranked by priority and degree of consensus during a final discussion, and the resulting recommendations and rankings included in a workshop report.

Seven T-Words: Principles of Transparent Research

A central aim of the workshop is to move beyond the debates around the R-words (reproducible, replicable, repeatable, etc) to focus on the elements of excellent research that the R-words ultimately represent and that automated provenance management can help deliver:

  • Trustworthy publications, results, and recommendations
  • Transparent research processes that facilitate review and assessment
  • True records of the methods and processes yielding research artifacts
  • Traceable derivation lineages of individual data products
  • Trials demonstrated to rigorously enact well-defined study designs
  • Tests of hypotheses, protocols, and conclusions that are readily reviewed
  • Timely application of research outcomes to address pressing problems

Suggested Themes for Presentations

  • Significance of research transparency in addressing 21st-century existential threats
  • Actionable definitions of transparency, traceability, and related T-words
  • R-words meet T-words: how reproducibility enables transparency and vice versa
  • Transparent research objects: standards and interoperability
  • Provenance in support of FATE and FAIR principles
  • Needles in a haystack–querying and visualizing lineages of particular research products
  • What can I ask? Vocabularies and query languages for delivering traceability
  • Attachment issues. Associating domain-specific concepts with computational artifacts
  • Unsolved problems and other opportunities for collaboration and funding

Keynote Speaker

We are pleased to announce Lars Vilhuber as keynote speaker for the T7 workshop. Dr. Vilhuber is Executive Director of the Labor Dynamics Institute and Senior Research Associate in the ILR School at Cornell University. He serves as Data Editor for the American Economic Association where he has been central to the implementation of policies and procedures for verification of the reproducibility of computational research published in AEA journals. Dr. Vilhuber is also Managing Editor of the Journal of Privacy and Confidentiality, Chair of the American Statistical Association’s Committee on Privacy and Confidentiality, and member of various boards of restricted-access data centers, giving him great insight into the challenges and advantages of transparent and reproducible science.

In his talk “Principles of Transparent Research: Implementation Challenges” he will discuss the role of journal reproducibility checks in improving research transparency as well as the many challenges faced in implementation.

Workshop Format

T7 will be held Thursday, July 22, 2021 and will be fully Zoom-based for worldwide participation. Speakers will make 10-15 minute presentations; all participants will propose issues, definitions, desiderata, user stories, and success criteria in shared documents. A web-based audience response system will be used to score and rank these contributions anonymously.

Call for Abstracts

We invite submissions of 1-page abstracts on topics relevant to the workshop. Abstracts should highlight key points the speaker plans to submit to the audience for discussion and ranking. To submit an abstract please select the Workshop on Provenance for Transparent Research track in EasyChair.

Important Dates

  • T7 Abstract Submission: May 31st, 2021
  • T7 Speaker Notification: June 14th, 2021
  • Provenance Week: July 19-22, 2021
  • T7 Convenes: July 22, 2021 (Thursday)


  • Shawn Bowers (Gonzaga University)
  • Carole Goble (University of Manchester)
  • Bertram Ludaescher (UIUC)
  • Timothy McPhillips (UIUC)
  • Craig Willis (UIUC)

More Information

If you have any questions please contact Tim McPhillips.

ProvenanceWeek 2021

  • ProvenanceWeek 2021

Following successful past ProvenanceWeek events, ProvenanceWeek 2021 will again co-locate the IPAW and TaPP workshops as well as several satellite events that focus on novel directions for provenance.

Powered by Bootstrap 4 Github Pages