docs: Add guide for running crawler in web server #1174

Pijukatel · 2025-04-25T12:15:27Z

Description

Add guide for running crawler in web server

Issues

Closes: Feature parity: Support for running Crawlee in a web server environment #1148

Copilot

Pull Request Overview

This PR adds a guide for running the crawler in a web server by including new FastAPI server and crawler code examples along with configuration updates.

Updated pyproject.toml to include new file paths and disable specific error codes for the web server examples.
Added a FastAPI server example (server.py) to illustrate how to run the crawler from a web endpoint.
Introduced an asynchronous crawler implementation (crawler.py) with lifecycle management using an async context manager.

Reviewed Changes

Copilot reviewed 4 out of 5 changed files in this pull request and generated 1 comment.

File	Description
pyproject.toml	Updated configuration to include new file mappings for docs examples and added mypy overrides.
docs/guides/code_examples/running_in_web_server/server.py	Introduces a FastAPI server with endpoints for running and interacting with a crawler.
docs/guides/code_examples/running_in_web_server/crawler.py	Adds an asynchronous crawler setup with a default request handler and lifecycle management.

Files not reviewed (1)

docs/guides/running_in_web_server.mdx: Language not supported

docs/guides/code_examples/running_in_web_server/crawler.py

Mantisus

LGTM

docs/guides/running_in_web_server.mdx

vdusek · 2025-04-25T16:45:38Z

docs/guides/running_in_web_server.mdx

+- `/` - The index is just giving short description of the server with example link to the second endpoint.
+- `/scrape` - This is the endpoint that receives a `url` parameter and returns the page title scraped from the URL
+
+To run the example server, make sure that you have installed the [fastapi[standard]](https://fastapi.tiangolo.com/#installation) and you can use the command `fastapi dev server.py` from the directory where the example code is located.


could we have a separate triple-backticks (```) command here for executing the server?

Ok. How is it different in this case? It seems to be rendered in the same way

It's different - right now the command is "lost" within the paragraph. When I rendered the website locally, I couldn't find it for a while. Since it's an important command, having it separated in triple backticks would make it more visible - especially when you're rushing through the docs, copying the example, and looking for a command to try it out.

docs/guides/running_in_web_server.mdx

pyproject.toml

docs/guides/running_in_web_server.mdx

pyproject.toml

…n-web-server

janbuchar · 2025-04-28T13:53:28Z

docs/guides/code_examples/running_in_web_server/crawler.py

+from crawlee.crawlers import ParselCrawler, ParselCrawlingContext
+
+
+class State(TypedDict):


Not really important, but in FastAPI, you usually use dependencies for this kind of business.

docs/guides/running_in_web_server.mdx

vdusek

LGTM

Add example guide for running in web server

6be3dc8

Pijukatel added documentation Improvements or additions to documentation. t-tooling Issues with this label are in the ownership of the tooling team. labels Apr 25, 2025

github-actions bot assigned Pijukatel Apr 25, 2025

github-actions bot added this to the 113rd sprint - Tooling team milestone Apr 25, 2025

Pijukatel requested a review from Copilot April 25, 2025 12:17

Copilot AI reviewed Apr 25, 2025

View reviewed changes

docs/guides/code_examples/running_in_web_server/crawler.py Show resolved Hide resolved

Pijukatel requested review from Mantisus and vdusek April 25, 2025 12:20

Pijukatel marked this pull request as ready for review April 25, 2025 12:20

Mantisus approved these changes Apr 25, 2025

View reviewed changes

vdusek requested changes Apr 25, 2025

View reviewed changes

Review comments

7181f0b

Pijukatel requested a review from vdusek April 28, 2025 07:26

vdusek reviewed Apr 28, 2025

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

Update comment pyproject.toml

bdb1938

Pijukatel requested a review from vdusek April 28, 2025 11:26

Merge remote-tracking branch 'origin/master' into add-guide-running-i…

5d09c7c

…n-web-server

janbuchar reviewed Apr 28, 2025

View reviewed changes

vdusek reviewed Apr 28, 2025

View reviewed changes

docs/guides/running_in_web_server.mdx Outdated Show resolved Hide resolved

fastapi command on own line

fc78092

Pijukatel requested a review from vdusek April 29, 2025 06:28

vdusek approved these changes Apr 29, 2025

View reviewed changes

Mantisus approved these changes Apr 29, 2025

View reviewed changes

Pijukatel merged commit ec2dd15 into master Apr 29, 2025
23 checks passed

Pijukatel deleted the add-guide-running-in-web-server branch April 29, 2025 13:33

		from crawlee.crawlers import ParselCrawler, ParselCrawlingContext


		class State(TypedDict):

docs: Add guide for running crawler in web server #1174

docs: Add guide for running crawler in web server #1174

Uh oh!

Conversation

Pijukatel commented Apr 25, 2025

Description

Issues

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Mantisus left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vdusek Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

Pijukatel Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

vdusek Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janbuchar Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vdusek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants