Skip to content

Environmental identifier choice guidelines

The pages on biomedical identifiers and geospatial identifiers may also be worth consulting, though some sections of the former are also reproduced below.

Entity classes

Chemical entities (compounds, substances)

Prefer PubChem CIDs (http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID$1) (wdt:P662)

  • CAS registry numbers are imprecise (per Tom Luechtefeld)
    • BioBricks/SPOKE standardizing on PubChem CIDs already
  • 1.3m IDs (out of ~111m) mapped to Wikidata; we may need to federate with other external sources

Taxa

Prefer NCBI taxa IDs (http://purl.obolibrary.org/obo/NCBITaxon_$1) (wdt:P685)

  • 600k IDs (out of 2.7m) mapped to Wikidata; Mahir could try to map the remainder automatically (is already planning this with elurikkus.ee)