Metadata-Version: 2.3
Name: rbc_pdf_statement_parser
Version: 0.1.1
Summary: A Python-based parser for RBC Business Account statements
Project-URL: homepage, https://github.com/DeflateAwning/rbc-pdf-statement-parser
Project-URL: repository, https://github.com/DeflateAwning/rbc-pdf-statement-parser
Project-URL: issues, https://github.com/DeflateAwning/rbc-pdf-statement-parser/issues
Project-URL: license, https://github.com/DeflateAwning/rbc-pdf-statement-parser/blob/main/LICENSE
Author: DeflateAwning
License-Expression: Unlicense
License-File: LICENSE
Classifier: License :: OSI Approved :: The Unlicense (Unlicense)
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Requires-Dist: jpype1
Requires-Dist: orjson
Requires-Dist: pandas
Requires-Dist: polars
Requires-Dist: pyarrow
Requires-Dist: pymupdf
Requires-Dist: tabula-py
Description-Content-Type: text/markdown

# rbc-pdf-statement-parser
A Python-based parser for RBC Business Account statements

## Usage

1. Collect all PDF account statements in a single folder (not nested).
2. Install this library with `python3 -m pip install rbc_pdf_statement_parser`.
3. Run with `python3 -m rbc_pdf_statement_parser <input_directory> <output_directory>`.

## Features

* Converts each PDF to a CSV of transactions, and a JSON with metadata.
* Creates a single CSV and Parquet which inventories the metadata of each PDF file.
* Creates a single CSV and Parquet of every transaction from every PDF.
