WebScraper

A straightforward web scraper written in PHP, with support for parallel processing and HTML5.

Installation

To start using this package, add it to your composer.json file and call composer install, then include the generated autoload.php in your project. Alternatively, download and include the package along with its dependencies directly into your project.

Dependencies

Usage

The scraper takes 2 inputs: an array of Request Options that define the resources to gather, and an array of Extracton Rules to specify what data we're looking for in those resources. For more information on Request Options or Extraction Rules, read the respective docs.

require 'autoload.php';

$rules = 'path/to/rules.json';
$options = [
	'foo' => ['URL' => 'https://...']
];

$scraper = new WebScraper($rules);
$result = $scraper->start($options);

Stats

Dependent repositories

Total releases

Latest release

Nov 27, 2020

First release

May 2, 2020

Stars

Forks

Watchers

Contributors

Repository size

18.6 KB

SourceRank

Development practices

Source repo 2FA enabled

TEXT!

Package manager 2FA enabled

TEXT!

Is security responsive

TEXT!

Dependencies are managed

TEXT!

Issue-free release available

TEXT!

Succession plan available

TEXT!

Package manager 2FA enabled

TEXT!

The Tidelift Subscription provides access to a continuously curated stream of human-researched and maintainer-verified data on open source packages and their licenses, releases, vulnerabilities, and development practices.

Learn more →

ppajer/webscraper
Release

Release

Documentation

WebScraper

Installation

Dependencies

Usage

Stats

Development practices

Contributors

ppajer/webscraper Release

Release Toggle Dropdown

Documentation

WebScraper

Installation

Dependencies

Usage

Stats

Development practices

Contributors

ppajer/webscraper
Release

Release