Skip to content

WaveGenAI/Crawler

Repository files navigation

Crawler

The crawler for WaveGenAI

Setup

  1. Install the required packages
pip install -r requirements.txt
  1. Install docker

Usage

Run the proxy

docker run -d --rm -it -p 3128:3128 -p 4444:4444 -e "TOR_INSTANCES=40" zhaowde/rotating-tor-http-proxy

Run the crawler,

python main.py --csv --input src_data.txt --overwrite --file_name FILE.csv --num_processes 40

License

This project is licensed under the MIT License - see the LICENSE file for details

About

The crawler for WaveGenAI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages