Metadata-Version: 2.1
Name: vigorish
Version: 0.4.16
Summary: Hybrid Python/Node.js web scraper for Major League Baseball (MLB) data.
Home-page: https://github.com/a-luna/vigorish
Author: Aaron Luna
Author-email: contact@aaronluna.dev
Maintainer: Aaron Luna
Maintainer-email: contact@aaronluna.dev
License: MIT
Project-URL: Bug Tracker, https://github.com/a-luna/vigorish/issues
Project-URL: Source Code, https://github.com/a-luna/vigorish
Project-URL: Documentation, https://aaronluna.dev/projects/vigorish
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: POSIX :: Linux
Classifier: Operating System :: Unix
Classifier: Programming Language :: JavaScript
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3 :: Only
Requires-Python: >=3.6
Description-Content-Type: text/markdown
Requires-Dist: aenum
Requires-Dist: boto3
Requires-Dist: bullet
Requires-Dist: click (==7.1.2)
Requires-Dist: dacite
Requires-Dist: dataclass-csv
Requires-Dist: enlighten
Requires-Dist: events
Requires-Dist: halo
Requires-Dist: lxml
Requires-Dist: naked
Requires-Dist: py-getch
Requires-Dist: pyfiglet
Requires-Dist: python-dateutil
Requires-Dist: python-dotenv
Requires-Dist: rapidfuzz
Requires-Dist: requests
Requires-Dist: scipy
Requires-Dist: sqlalchemy (==1.3.23)
Requires-Dist: sqlalchemy-utils
Requires-Dist: tabulate
Requires-Dist: tqdm
Requires-Dist: urllib3
Requires-Dist: w3lib

[![PyPI version](https://badge.fury.io/py/vigorish.svg)](https://badge.fury.io/py/vigorish) ![PyPI - Downloads](https://img.shields.io/pypi/dm/vigorish?color=%234DC71F) ![PyPI - License](https://img.shields.io/pypi/l/vigorish?color=%25234DC71F) ![PyPI - Python Version](https://img.shields.io/pypi/pyversions/vigorish) [![Maintainability](https://api.codeclimate.com/v1/badges/4a1753c77add039c3850/maintainability)](https://codeclimate.com/github/a-luna/vigorish/maintainability) [![codecov](https://codecov.io/gh/a-luna/vigorish/branch/master/graph/badge.svg)](https://codecov.io/gh/a-luna/vigorish)

## vigorish

`vigorish` is a hybrid Python/Node.js application that scrapes MLB data from mlb.com, brooksbaseball.net and baseball-reference.com.

My goal is to capture as much data as possible &mdash; ranging from PitchFX measurements at the most granular level to play-by-play data (play descriptions, substitutions, manager challenges, etc) and individual player pitch/bat stats at the highest level.

### Requirements

-   Python 3.6+
-   Node.js 10+ (Tested with Node.js 11-13)
-   Xvfb
-   AWS account (optional but recommended, used to store scraped data in S3)

### Project Documentation

For a step-by-step install guide and instructions for configuring/using `vigorish`, please visit the link below:

[Vigorish: Hybrid Python/Node.Js Web Scraper](https://aaronluna.dev/projects/vigorish/)

### Credits

`vigorish` either relies on the following projects listed below directly or as a dev dependency. It would not have been possible for me to create `vigorish` without these projects, thanks to all of the creators/maintainers for making these available (projects are listed alphabetically):

-   [aenum](https://github.com/ethanfurman/aenum)
-   [bullet](https://github.com/bchao1/bullet)
-   [click](https://github.com/pallets/click)
-   [dacite](https://github.com/konradhalas/dacite)
-   [dataclass-csv](https://github.com/dfurtado/dataclass-csv)
-   [enlighten](https://github.com/Rockhopper-Technologies/enlighten)
-   [events](https://github.com/pyeve/events)
-   [halo](https://github.com/manrajgrover/halo)
-   [lxml](https://github.com/lxml/lxml)
-   [naked](https://github.com/chrissimpkins/naked)
-   [rapidfuzz](https://github.com/maxbachmann/rapidfuzz)
-   [tabulate](https://github.com/astanin/python-tabulate)

