An application framework for crawling websites and extracting structured data (https://docs.scrapy.org/)
- python3.11 (pyenv)
- poetry
poetry install
poetry shell
playwright install
# scrapy startproject <project_name> [project_dir]
scrapy startproject databius .
scrapy genspider apache.org www.apache.org/logos
scrapy genspider firstmark.com https://mad.firstmark.com/
scrapy crawl apache.org