This project aims to apply predictive analytics to horse racing via Python.
Prior to using predictive_punter, the package must be installed in your current Python environment. In most cases, an automated installation via PyPI and pip will suffice, as follows:
pip install predictive_punter
If you would prefer to gain access to new (unstable) features via a pre-release version of the package, specify the 'pre' option when calling pip, as follows:
pip install --pre predictive_punter
To gain access to bleeding edge developments, the package can be installed from a source distribution. To do so, you will need to clone the git repository and execute the setup.py script from the root directory of the source tree, as follows:
git clone https://github.com/justjasongreen/predictive_punter.git cd predictive_punter python setup.py install
If you would prefer to install the package as a symlink to the source distribution (for development purposes), execute the setup.py script with the 'develop' option instead, as follows:
python setup.py develop
By installing predictive_punter, a number of command line utilities are made available in your current Python environment, as described below...
The 'scrape' command line utility can be used to populate a database with racing data scraped from the web. The syntax of the scrape command is:
scrape [-b] [-d <database_uri>] [-q] [-r <redis_uri>] [-v] date_from [date_to]
The mandatory date_from and optional date_to arguments must be in the format YYYY-MM-DD, and define the (inclusive) range of dates to scrape data for.
If the -b (or --backup-database) option is specified, all collections in the database will be cloned after each date successfully scraped. If an error occurs while scraping a date and the -b option has been specified, the collections in the database will be restored from the cloned collections before the script terminates.
The -d (or --database-uri=) option can be used to specify a URI for the target database. The target database must be a MongoDB version 2.6 or higher database. The default database URI is mongodb://localhost:27017/predictive_punter.
The -r (or --redis-uri=) option can be used to specify a URI for a redis server to be used for HTTP request caching. The default redis URI is redis://localhost:6379/predictive_punter. If a connection cannot be established with the specified redis server, the script will attempt to use the built in redislite service, or will run without HTTP request caching if the redislite service cannot be used.
The -q and -v (or --quiet and --verbose) options can be used to control the logging output generated by the scrape command. When the -q option is used, the logging level will be set to logging.WARNING. When the -v option is used, the logging level will be set to logging.DEBUG. By default, the logging level will be set to logging.INFO.
The 'seed' command line utility can be used to pre-seed query data for runners in the database. The syntax of the seed command is:
seed [-b] [-d <database_uri>] [-q] [-r <redis_uri>] [-v] date_from [date_to]
The application of the various command line options and arguments is the same as for the 'scrape' command described above.
Development and Testing
The source distribution includes a test suite based on pytest. To ensure compatibility with all supported versions of Python, it is recommended that the test suite be run via tox.
To install all development and test requirements into your current Python environment, execute the following command from the root directory of the source tree:
pip install -e .[dev,test]
To run the test suite included in the source distribution, execute the tox command from the root directory of the source tree as follows: