vision-researcher

History

Name		Name	Last commit message	Last commit date
parent directory ..
.gitignore		.gitignore
README.md		README.md
instructions.md		instructions.md
requirements.txt		requirements.txt
screenshot.png		screenshot.png
vision_researcher.py		vision_researcher.py

README.md

Vision Researcher

It is a demonstration application that uses ScreenshotOne API to render a full-page screenshot of website pages, apply OCR to it, and search for given patterns.

It uses the following technologies:

Check out more examples in the ScreenshotOne examples repository.

How it works

You provide an URL via the CLI argument and the application will:

Take a full-page screenshot of the given URL;
Split the screenshot into multiple parts;
Apply OCR to each part;
Search for the given patterns in the OCRed parts by asking AI to answer that if the pattern is present in the part;
Then it will get the HTML content of the page;
Parse links and navigate to the internal links.
And repeat the process from step 1 for the new page till it finds the match of the given patterns on the OCRed part of the page.
Then it will print the results.

The code was written with the help of Cursor as specified in the instructions.

How to build and run

Clone the repository:

git clone https://github.com/screenshotone/examples.git

Go to the examples/python/vision-researcher directory:

cd examples/python/vision-researcher

Install the dependencies:

pip install -r requirements.txt

Create a .env file and set the following environment variables:

SCREENSHOTONE_API_KEY=your_screenshotone_api_key
OPENAI_API_KEY=your_openai_api_key

Run the application:

python vision_researcher.py <url> <prompt> <max pages>

For example, to search for the content containing "testimonials" on the ScreenshotOne website:

python vision_researcher.py https://screenshotone.com "Does the website page or the website page parts contain testimonials?" 5

The results will be printed in the console:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

vision-researcher

vision-researcher

README.md

Vision Researcher

How it works

How to build and run

Files

vision-researcher

Directory actions

More options

Directory actions

More options

Latest commit

History

vision-researcher

Folders and files

parent directory

README.md

Vision Researcher

How it works

How to build and run