Skip to content

databius/scrapybius

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scrapy

An application framework for crawling websites and extracting structured data (https://docs.scrapy.org/)

Getting Started

Prerequisites

  • python3.11 (pyenv)
  • poetry

Setup development env and running tests

poetry install
poetry shell

Playwright

playwright install

Creating a project

# scrapy startproject <project_name> [project_dir]
scrapy startproject databius .

Start the first spider

scrapy genspider apache.org www.apache.org/logos
scrapy genspider firstmark.com https://mad.firstmark.com/

Run a spider

scrapy crawl apache.org

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages