nhl-logo-scraper

Scrape logos for all NHL teams


Keywords
cli
License
MIT
Install
pip install nhl-logo-scraper==1.1.0

Documentation

NHL Logo Scraper

https://travis-ci.org/blindman/nhl-logo-scraper.svg?branch=master

Scrapes logos for all NHL teams, leagues, and networks

Contents

Usage

NOTE: You must have cairo installed before using trying to run this package. In the near future I will try to find a python library that does not require installing an external dependency.

First clone the project.

$ git clone https://github.com/blindman/nhl-logo-scraper.git

Once you've cloned the project, run the following commands to get the utility working in your terminal.

$ pip install .
$ nhlscraper --help

Scrape

$ nhlscraper scrape [--output=DIR] [--format EXTENSIONS] [--archive]
--output=DIR The directory to dump all of the files [default: ./output]
--format EXTENSIONS
  Image output formats (svg and/or png) [default: svg]
--archive Archive any previous logos that were downloaded [default: True]
--debug Allow debugging logs through

Clean

$ nhlscraper clean [--dir=DIR] [--category CATEGORIES] [--full]
--dir=DIR The directory that the files live in [default: ./output]
--category CATEGORIES
  Categories to remove all files (league, network, team)
--full Clean all files created by this tool (includes archives)
--debug Allow debugging logs through

Other

$ nhlscraper -h | --help
$ nhlscraper --version

Contributing

Assuming you've cloned the project and created a virtualenv, run the following command to work with the code.

$ virtualenv --python python3 venv $ source venv/bin/activate $ pip install -e .[test]

To test the code, run the following command.

$ python setup.py test

Notes

This is a fork of my old imageDownloader repository which did the same thing but as a bash script and for the previous version of nhl.com

You may have noticed that nhl.com is requested and parsed to find ::code`nhl-logos.css` instead of retrieving the css file directly. This is because (I believe) the path to that file could potentially change. The current path is ::code::.../builds/.../f0683b50089da6c16c1df297a9a74ce08e79b94b_1507226889/.../nhl-logos.css and since the big chunk in the middle looks like a unique identifier I am going to assume it will change. I am looking to implement some sort of caching so it doesn't make a request every time.

License

MIT © [Jon Heller](https://github.com/blindman)