puppy-csslib

A CSS Selector library to parse html input and give json output with css selectors.


Keywords
puppy, csslib, pupeteer, selectorlib
License
MIT
Install
npm install puppy-csslib@1.1.0

Documentation

puppy-csslib

A CSS Selector library to read YAML files to parse static or SPA Pupeteer page and give json output with css selectors.

  • Free software: MIT license

Example

Example code is provided in test.js

Goal

  • To scrap SPA web applications with puppeteer in a generic way with configuration support.
  • also have javascript click function to expand some text so that can be scraped as well.
  • Text replace to replace scraped text with empty string

Features

  • replace text
  • css selectors
  • select and format attribute & text content
  • trim and normalize the text
  • uses document.querySelector and querySelectorAll