Environmental identifier choice guidelines
The pages on biomedical identifiers and geospatial identifiers may also be worth consulting, though some sections of the former are also reproduced below.
Entity classes
Chemical entities (compounds, substances)
Prefer PubChem CIDs (http://rdf.ncbi.nlm.nih.gov/pubchem/compound/CID$1) (wdt:P662)
- CAS registry numbers are imprecise (per Tom Luechtefeld)
- BioBricks/SPOKE standardizing on PubChem CIDs already
- 1.3m IDs (out of ~111m) mapped to Wikidata; we may need to federate with other external sources
Taxa
Prefer NCBI taxa IDs (http://purl.obolibrary.org/obo/NCBITaxon_$1) (wdt:P685)
- 600k IDs (out of 2.7m) mapped to Wikidata; Mahir could try to map the remainder automatically (is already planning this with elurikkus.ee)