Infrastructure for reproducible data analysis

sj · July 11, 2019, 3:22pm

Whole Tale, Jupyterhub, Binderhub, and other tools are important for reproducible data analysis. Currently they have to be hosted and run in a cloud somewhere; there’s no canonical hosted service available. As a result, organizations that want these sorts of services are getting sucked into multi-year contracts with big proprietary players who bundle such things with existing large-ticket services.

How can we make it easy for every academic institution to run their own instance or support a free-knowledge network sercice? What options and services and commitments are needed, where to start, who is doing this now?

See Chris Holdgraf’s comment here:

Topic	Replies	Views
Tools for sharing + maintaining code and data: Dataverse+ Underlay	410	March 4, 2020
Introducing eLife’s first computationally reproducible article General	431	February 28, 2019
Qri: A global dataset version control system built on the distributed web Underlay	432	March 23, 2019
Sci-Hub continues to provide a backstop to more complete solutions General	428	January 16, 2019
Mind the Gap: A landscape survey + analysis of open publishing PubPub	332	August 8, 2019

Infrastructure for reproducible data analysis

Related topics