This job scraper is for the #jobs channel in our Slack Community.
Prerequisites:
- 🐍 Python 3.10 or higher
- 📦 Poetry as the package manager
- 👩🏽💻 VS Code is the recommended IDE, but you can use others like PyCharm or Sublime
- ⚙️
.envfile with SLACK and OPENAI keys (ask maintainer or use your own values)
-
Clone the repo and open it in your IDE
-
Open the Intergrated Terminal and install the packages and dependencies
poetry install
-
Create a
.envfile with the appropriate ENVIRONMENT_VARIABLES (ask a maintainer)# Run these tests to check if your system is setup poetry run pytest tests/test_setup.py
- main.ipynb is the main program that does the scraping and posting to Slack
/notebookscontains notebooks with runnable code!/jobscontains the core modules for this "product"/testscontains the tests!- Config files and such at the Workspace Root
- Linter and Formatter: Ruff
- Testing: pytest
- Project Config: pyproject.tom
Currently, this repo scrapes from Indeed and Google Jobs.
💡 I used this article about the "best job sites in 2024" to start
The main steps are:
- Use Pylenium to scrape jobs from Indeed and parse data to a standard format
- Use AI to filter out irrelevant jobs
- Post relevant jobs to the
#jobschannel in the QAP Slack Community