Skip to content
Jakob Voß edited this page Apr 30, 2019 · 2 revisions

Notes:

  • Downloading 37 GB bz2 compressed JSON dump (over DFG network) takes 5 hours.
  • Simple processing with jq-wikidata
/usr/bin/time --output=jq.log -p sh -c 'bzcat wikidata-20190422-all.json.bz2 | jq --stream -c "include \"wikidata\"; ndjson|.id" > /dev/null' &
Clone this wiki locally