You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I believe @jcushman has been working on archiving the datasets from data.gov, and some of it will have been captured in the web crawling being done by Internet Archive, but I don't know how fully they have gotten it at this point.
Basically we are routinely capturing the metadata of the data.gov index itself, as well as a copy of each URL it points to, and we're figuring out an affordable way to make that searchable and clonable for data science. There are likely things being missed between the two efforts still -- anything that needs a deep crawl but either isn't on the EOT list or isn't generically crawlable.
This seems concerning: https://www.reddit.com/r/climate/comments/1idin45/the_us_governments_open_data_is_currently_being/
From the thread:
Are data.gov datasets being covered by the EOT archive? I don't see any specific info about these.
The text was updated successfully, but these errors were encountered: