Skip to content

Latest commit

 

History

History
29 lines (20 loc) · 1.81 KB

File metadata and controls

29 lines (20 loc) · 1.81 KB

Adding additional information

This chapter describes various efforts that have taken place in the past to add content to Wikidata from various (peer reviewed) sources. The serve as an example.

Adding chemical compounds

The first 200 thousand chemical compounds were added in the work by Waagmeester, Stupp, Burgstaller-Muehlbacher, and others [Q87830400]. Willighagen wrote a CDK and Bacting based script to add chemical structures , now available as Wikidata/createWDitemsFromSMILES.groovy. This has been used to add many chemical compounds. By default, it only adds compounds with full stereochemistry defined. It add the SMILES, InChI, InChIKey, and mass. If the InChIKey gives a match in PubChem, then the PubChem CID is added too.

Melting points

Adding properties follow a similar process. If a SMILES is given, an InChIKey can be calculated, which can be used to find the Wikidata items to which a property belongs. This has been used to add melting points from the Jean-Claude Bradley Open Melting Point Dataset [Q69644056] using another Groovy script, MeltingPoints/createQuickStatements.groovy.

Boiling points

Earlier this year, another set of bioling points have been added, sourced from a 2004 article [Q51983889]. Yet another Groovy script, BoilingPoints/createQuickStatements.groovy, uses this gist as input to create QuickStatements.

References