Job Posting Filtering Tool

Save time by using this to retrieve postings from job boards and filter out those you don't want to see. Built on top of JobSpy.

Features

The scrape_and_filter_jobs function allows for all the same arguments as jobspy.scrape_jobs, with three additional optional arguments:

filter_state - takes a state abbreviation and filters out any postings from other states
- For example, if you're looking for jobs in and around NYC but only want to see jobs in New York State (ignoring NJ, CT) you would set filter_state="NY"
exclude_companies - takes a list of strings for company names you want to ignore
exclude_title_terms - takes a list of strings for terms in job titles you want to ignore (senior, principal, new grad, etc.)

The console will display how many jobs the scraper found and how many were removed by each filter.

I've also created export_jobs_to_csv which takes a pandas DataFrame and outputs a timestamped CSV file for easier viewing of the job postings.

Getting Started

1. Install jobspy

pip install -U python-jobspy

See JobSpy for more details.

2. Clone the repository

git clone https://github.com/zkornbluth/job-searcher.git

3. Use the function in your own .py script

Example usage:

from scrape_and_filter_jobs import scrape_and_filter_jobs, export_jobs_to_csv

jobs = scrape_and_filter_jobs(
        site_name=["indeed", "linkedin"]
        search_term="software engineer",
        location="New York, NY",
        distance=25,
        hours_old=24,
        country_indeed="USA",
        results_wanted=50,
        linkedin_fetch_description=True,
        exclude_title_terms=["Senior", "Sr", "Lead", "Principal"],
        filter_state="NY",
        exclude_companies=["Jobright.ai", "Lensa"]
    )

export_jobs_to_csv(jobs)

Be sure to put your script in the same folder as scrape_and_filter_jobs.py.

See main.py for how I scrape and filter jobs in multiple locations and combine the results into one CSV file.

Sample output from main.py:

Disclaimer - I have only successfully tested this scraping from Indeed and LinkedIn in the US.

Setting up a Recurring Schedule on Mac

1. Express the schedule you want in cron syntax

There are five fields: minute, hour, day of the month, month, and weekday (in that order). An asterisk represents a wildcard - any possible value.

For example, to schedule something for 9am every weekday, you'd represent it like this: 0 9 * * 1-5

2. Find the path to your Python version and your script

For example, my Python version is at /Library/Frameworks/Python.framework/Versions/3.13/bin/python3 and my script is at /Users/zkornbluth/Documents/Code/Python/job_searcher/main.py

3. Open a new Terminal window and run this command

crontab -e

This opens the crontab file which lets you schedule bash commands

4. Edit the crontab file

Press the i key to begin editing the file.

Type in your command in this order: [schedule] [Python version] [script]. So to run my script at 9am every weekday, I'd type: 0 9 * * 1-5 /Library/Frameworks/Python.framework/Versions/3.13/bin/python3 /Users/zkornbluth/Documents/Code/Python/job_searcher/main.py

Hit the Escape key, then type :wq to save and quit. You should see a message in Terminal that says crontab: installing new crontab.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
scrape_and_filter_jobs.py		scrape_and_filter_jobs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Job Posting Filtering Tool

Features

Getting Started

1. Install jobspy

2. Clone the repository

3. Use the function in your own .py script

Setting up a Recurring Schedule on Mac

1. Express the schedule you want in cron syntax

2. Find the path to your Python version and your script

3. Open a new Terminal window and run this command

4. Edit the crontab file

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Job Posting Filtering Tool

Features

Getting Started

1. Install jobspy

2. Clone the repository

3. Use the function in your own .py script

Setting up a Recurring Schedule on Mac

1. Express the schedule you want in cron syntax

2. Find the path to your Python version and your script

3. Open a new Terminal window and run this command

4. Edit the crontab file

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages