WikiCite: metadata, inbound links, and contextual relevance for every cited work

WikiCite is a spec for a completion of citations:

  • Structured data, inbound links, and contextual relevance for every cited work
  • Entity resolution for {article-citation strings}, {article titles}, {authors}, {affiliations}, {funding sources}

Applied to a growing circle of cites:

  1. scholarly cites on en:wp (almost done),
  2. all scholarly cites + book cites on en:wp,
  3. all scholarly+book cites + all other cites from WP + other major public-knowledge projects,
  4. all cites from the top tier of existing works in wikicite

Current bottlenecks include:

  • Performance (Wikidata as a central repo slows down if you run lots of complex citation queries; all cites are being stored there atm),
  • Access (90% of this metadata exists in digital form, with some duplication, in a dozen private databases and scholarly graphs.
  • Disambiguation (entity resolution + deduplication are unsolved problems)

Groups working on this include:

  • Lens.org (articles and patents)
  • FatCat (articles @ IA)
  • Meta (not yet public)
  • MS Academic, Aminer, GScholar (semi-private data w/ semi-public interfaces, updated regularly)