Watch our latest webinar to understand the difference between data from Libraries.io and the Tidelift Subscription.

Scrapy
Release 2.11.2

A high-level Web Crawling and Web Scraping framework

Homepage Repository PyPI Python

Keywords: crawler, crawling, framework, hacktoberfest, python, scraping, web-scraping, web-scraping-python
License: BSD-3-Clause
Install: pip install Scrapy==2.11.2

Documentation

Scrapy

Overview

Scrapy is a BSD-licensed fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Scrapy is maintained by Zyte (formerly Scrapinghub) and many other contributors.

Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.

Requirements

Python 3.8+
Works on Linux, Windows, macOS, BSD

Install

The quick way:

pip install scrapy

See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.

Documentation

Documentation is available online at https://docs.scrapy.org/ and in the docs directory.

Releases

You can check https://docs.scrapy.org/en/latest/news.html for the release notes.

Community (blog, twitter, mail list, IRC)

See https://scrapy.org/community/ for details.

Contributing

See https://docs.scrapy.org/en/master/contributing.html for details.

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct.

By participating in this project you agree to abide by its terms. Please report unacceptable behavior to opensource@zyte.com.

Companies using Scrapy

See https://scrapy.org/companies/ for a list.

Commercial Support

See https://scrapy.org/support/ for details.

Dependencies: 17
Dependent packages: 509
Dependent repositories: 317
Total releases: 99
Latest release: May 14, 2024
First release: Dec 12, 2009
Stars: 52.7K
Forks: 10.5K
Watchers: 1,775
Contributors: 549
Repository size: 25.1 MB
SourceRank: 25

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!

Releases

2.11.2: May 14, 2024
2.11.1: Feb 14, 2024
1.8.4: Feb 14, 2024
2.11.0: Sep 18, 2023
2.10.1: Aug 30, 2023
2.10.0: Aug 4, 2023
2.9.0: May 8, 2023
2.8.0: Feb 2, 2023
2.7.1: Nov 2, 2022
2.7.0: Oct 17, 2022

See all 99 releases

Contributors

See all contributors

Login to resync this project