In recent years, a number of organizations have started to fund such collaboratives, and set up new PhD programs to train data scientists focused on building them. Berkeley and BIDS in particular has spearheaded the concept of ‘data collaboratives’ as networks of practice around a topical area of data.
Here’s an example from 2018: https://data.berkeley.edu/news/data-collaboratives-moving-knowledge-action
What are examples of successful data collaboratives built from communities of practice?
Some efforts to train contributors to future collaboratives:
- A summary of current practices.
- The Schmidt Science Fellows program, which sometimes matches fellows with projects
- NASA’s Datanauts
Some topical collaboratives:
- A lightly curated overview courtesy of GovLab.
- At MIT, the Climate CoLab, collaboratively building proposals (grounded in shared data and policy) to reach climate goals
As we start defining registries, these are groups to consider. Many repositories of datasets + annotations from a wide range of sources, which may make good registries.