Please consider whether this resource would be good for your list. It a large collection of data about entities such as people, businesses, and organizations. It also includes code to ETL this data for use in NLP projects.
https://github.com/az0/entity-metadata/