JobSpy is a simple, yet comprehensive, job scraping library.
Not technical? Try out the web scraping tool on our site at usejobspy.com.
Looking to build a data-focused software product? Book a call to work with us.
- Scrapes job postings from LinkedIn, Indeed, Glassdoor, & ZipRecruiter simultaneously
- Aggregates the job postings in a Pandas DataFrame
- Proxies support
pip install -U python-jobspy
Python version >= 3.10 required
import csv
from jobspy import scrape_jobs
jobs = scrape_jobs(
site_name=["indeed", "linkedin", "zip_recruiter", "glassdoor"],
search_term="software engineer",
location="Dallas, TX",
results_wanted=20,
hours_old=72, # (only Linkedin/Indeed is hour specific, others round up to days old)
country_indeed='USA', # only needed for indeed / glassdoor
# linkedin_fetch_description=True # get full description , direct job url , company industry and job level (seniority level) for linkedin (slower)
# proxies=["208.195.175.46:65095", "208.195.175.45:65095", "localhost"],
)
print(f"Found {len(jobs)} jobs")
print(jobs.head())
jobs.to_csv("jobs.csv", quoting=csv.QUOTE_NONNUMERIC, escapechar="\\", index=False) # to_excel
SITE TITLE COMPANY CITY STATE JOB_TYPE INTERVAL MIN_AMOUNT MAX_AMOUNT JOB_URL DESCRIPTION
indeed Software Engineer AMERICAN SYSTEMS Arlington VA None yearly 200000 150000 https://www.indeed.com/viewjob?jk=5e409e577046... THIS POSITION COMES WITH A 10K SIGNING BONUS!...
indeed Senior Software Engineer TherapyNotes.com Philadelphia PA fulltime yearly 135000 110000 https://www.indeed.com/viewjob?jk=da39574a40cb... About Us TherapyNotes is the national leader i...
linkedin Software Engineer - Early Career Lockheed Martin Sunnyvale CA fulltime yearly None None https://www.linkedin.com/jobs/view/3693012711 Description:By bringing together people that u...
linkedin Full-Stack Software Engineer Rain New York NY fulltime yearly None None https://www.linkedin.com/jobs/view/3696158877 Rainβs mission is to create the fastest and ea...
zip_recruiter Software Engineer - New Grad ZipRecruiter Santa Monica CA fulltime yearly 130000 150000 https://www.ziprecruiter.com/jobs/ziprecruiter... We offer a hybrid work environment. Most US-ba...
zip_recruiter Software Developer TEKsystems Phoenix AZ fulltime hourly 65 75 https://www.ziprecruiter.com/jobs/teksystems-0... Top Skills' Detailsβ’ 6 years of Java developme...
Optional
βββ site_name (list|str):
| linkedin, zip_recruiter, indeed, glassdoor
| (default is all four)
β
βββ search_term (str)
β
βββ location (str)
β
βββ distance (int):
| in miles, default 50
β
βββ job_type (str):
| fulltime, parttime, internship, contract
β
βββ proxies (list):
| in format ['user:pass@host:port', 'localhost']
| each job board scraper will round robin through the proxies
β
βββ is_remote (bool)
β
βββ results_wanted (int):
| number of job results to retrieve for each site specified in 'site_name'
β
βββ easy_apply (bool):
| filters for jobs that are hosted on the job board site
β
βββ description_format (str):
| markdown, html (Format type of the job descriptions. Default is markdown.)
β
βββ offset (int):
| starts the search from an offset (e.g. 25 will start the search from the 25th result)
β
βββ hours_old (int):
| filters jobs by the number of hours since the job was posted
| (ZipRecruiter and Glassdoor round up to next day.)
β
βββ verbose (int) {0, 1, 2}:
| Controls the verbosity of the runtime printouts
| (0 prints only errors, 1 is errors+warnings, 2 is all logs. Default is 2.)
βββ linkedin_fetch_description (bool):
| fetches full description and direct job url for LinkedIn (Increases requests by O(n))
β
βββ linkedin_company_ids (list[int]):
| searches for linkedin jobs with specific company ids
|
βββ country_indeed (str):
| filters the country on Indeed & Glassdoor (see below for correct spelling)
|
βββ enforce_annual_salary (bool):
| converts wages to annual salary
βββ Indeed limitations:
| Only one from this list can be used in a search:
| - hours_old
| - job_type & is_remote
| - easy_apply
β
βββ LinkedIn limitations:
| Only one from this list can be used in a search:
| - hours_old
| - easy_apply
JobPost
βββ title
βββ company
βββ company_url
βββ job_url
βββ location
β βββ country
β βββ city
β βββ state
βββ description
βββ job_type: fulltime, parttime, internship, contract
βββ job_function
β βββ interval: yearly, monthly, weekly, daily, hourly
β βββ min_amount
β βββ max_amount
β βββ currency
β βββ salary_source: direct_data, description (parsed from posting)
βββ date_posted
βββ emails
βββ is_remote
Linkedin specific
βββ job_level
Linkedin & Indeed specific
βββ company_industry
Indeed specific
βββ company_country
βββ company_addresses
βββ company_employees_label
βββ company_revenue_label
βββ company_description
βββ ceo_name
βββ ceo_photo_url
βββ logo_photo_url
βββ banner_photo_url
LinkedIn searches globally & uses only the location
parameter.
ZipRecruiter searches for jobs in US/Canada & uses only the location
parameter.
Indeed & Glassdoor supports most countries, but the country_indeed
parameter is required. Additionally, use the location
parameter to narrow down the location, e.g. city & state if necessary.
You can specify the following countries when searching on Indeed (use the exact name, * indicates support for Glassdoor):
Argentina | Australia* | Austria* | Bahrain |
Belgium* | Brazil* | Canada* | Chile |
China | Colombia | Costa Rica | Czech Republic |
Denmark | Ecuador | Egypt | Finland |
France* | Germany* | Greece | Hong Kong* |
Hungary | India* | Indonesia | Ireland* |
Israel | Italy* | Japan | Kuwait |
Luxembourg | Malaysia | Mexico* | Morocco |
Netherlands* | New Zealand* | Nigeria | Norway |
Oman | Pakistan | Panama | Peru |
Philippines | Poland | Portugal | Qatar |
Romania | Saudi Arabia | Singapore* | South Africa |
South Korea | Spain* | Sweden | Switzerland* |
Taiwan | Thailand | Turkey | Ukraine |
United Arab Emirates | UK* | USA* | Uruguay |
Venezuela | Vietnam* |
- Indeed is the best scraper currently with no rate limiting.
- All the job board endpoints are capped at around 1000 jobs on a given search.
- LinkedIn is the most restrictive and usually rate limits around the 10th page with one ip. Proxies are a must basically.
Q: Why is Indeed giving unrelated roles?
A: Indeed is searching each one of your terms e.g. software intern, it searches software OR intern. Try search_term='"software intern"' in quotes for stricter searching
Q: Received a response code 429?
A: This indicates that you have been blocked by the job board site for sending too many requests. All of the job board sites are aggressive with blocking. We recommend:
- Wait some time between scrapes (site-dependent).
- Try using the proxies param to change your IP address.
Q: Encountering issues with your queries?
A: Try reducing the number of results_wanted
and/or broadening the filters. If problems
persist, submit an issue.