Building Data Collaboratives

metasj · July 16, 2020, 4:00am

In recent years, a number of organizations have started to fund such collaboratives, and set up new PhD programs to train data scientists focused on building them. Berkeley and BIDS in particular has spearheaded the concept of ‘data collaboratives’ as networks of practice around a topical area of data.
Here’s an example from 2018: https://data.berkeley.edu/news/data-collaboratives-moving-knowledge-action

What are examples of successful data collaboratives built from communities of practice?

Some efforts to train contributors to future collaboratives:

A summary of current practices.
The Schmidt Science Fellows program, which sometimes matches fellows with projects
NASA’s Datanauts

Some topical collaboratives:

A lightly curated overview courtesy of GovLab.
At MIT, the Climate CoLab, collaboratively building proposals (grounded in shared data and policy) to reach climate goals

As we start defining registries, these are groups to consider. Many repositories of datasets + annotations from a wide range of sources, which may make good registries.

metasj · July 18, 2020, 4:03am

As Zach noted recently: Schmidt’s Technology + Society team is a fine example, as it has a focus on data collaboratives – efforts to ‘harness the flood of data being generated by the private sector to create public value’, each supported by a coalition of agencies and organizations focused on a topic. (so far including: the global refugee crisis, improving medical care, and increasing agricultural productivity in developing countries)

These collaboratives often have data maintenance + coordination + cross-referencing challenges well suited to the UL; and like to fund the development of specific projects and tools.

Topic		Replies	Views
Ideas for community engagement Underlay	3	628	November 22, 2020
Tools for sharing + maintaining code and data: Dataverse+ Underlay	0	426	March 4, 2020
Mechanisms for citizen science, citizen humanities PubPub feedback	0	400	February 1, 2019
Data and Model sharing Underlay provenance	0	355	August 20, 2020
Qri: A global dataset version control system built on the distributed web Underlay	0	443	March 23, 2019

Building Data Collaboratives

Related topics