Skip to content

andripwn/crawler-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

e0ef202 · May 11, 2020

History

3 Commits
May 11, 2020
May 11, 2020
May 11, 2020
May 11, 2020
May 11, 2020
May 11, 2020
May 11, 2020

Repository files navigation

Python Email Crawler

This python script search/google certain keywords, crawls the webpages from the results, and return all emails found.

Requirements

  • sqlalchemy
  • urllib2

If you don't have, simply sudo pip install sqlalchemy.

Usage

Start the search with a keyword. We use "iphone developers" as an example.

python email_crawler.py "iphone developers"

The search and crawling process will take quite a while, as it retrieve up to 500 search results (from Google), and crawl up to 2 level deep. It should crawl around 10,000 webpages :)

After the process finished, run this command to get the list of emails

python email_crawler.py --emails

The emails will be saved in ./data/emails.csv

About

email scraper/crawls using python (Google/Bing)

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages