Only focuses on "State Data Files" listed here. There are many other "Data Sources" listed, but they amount to online databases only accessible via custom web interfaces, which is out of the scope of the National Data Catalog.
# For development and production:
gem install datacatalog-importer
- Create
config.yml
usingconfig.example.yml
as a template.
First, rake pull
. Then, rake push
. It's easy.
Download and parse pages from the source catalog. You will need to specify the environment using the IMPORTER_ENV environment variable. Some examples:
IMPORTER_ENV=local rake pull
IMPORTER_ENV=sandbox rake pull
IMPORTER_ENV=production rake pull
Upload pulled data to the National Data Catalog API. You will need to specify the environment using the IMPORTER_ENV environment variable. Some examples:
IMPORTER_ENV=local rake push
IMPORTER_ENV=sandbox rake push
IMPORTER_ENV=production rake push
- the National Data Catalog Web site
- the National Data Catalog mailing list
- the National Data Catalog project page
- the Sunlight Labs mailing list
- the transparency chat room