Metadata-Version: 2.1
Name: sibi-dst
Version: 0.3.57
Summary: Data Science Toolkit
Author: Luis Valverde
Author-email: lvalverdeb@gmail.com
Requires-Python: >=3.11,<4.0
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Provides-Extra: complete
Provides-Extra: df-helper
Provides-Extra: geospatial
Requires-Dist: apache-airflow-client (>=2.10.0,<3.0.0)
Requires-Dist: chardet (>=5.2.0,<6.0.0)
Requires-Dist: charset-normalizer (>=3.4.0,<4.0.0)
Requires-Dist: clickhouse-connect (>=0.8.7,<0.9.0)
Requires-Dist: clickhouse-driver (>=0.2.9,<0.3.0)
Requires-Dist: dask[complete] (>=2025.3.0,<2026.0.0)
Requires-Dist: django (>=5.1.4,<6.0.0) ; extra == "df-helper" or extra == "complete"
Requires-Dist: djangorestframework (>=3.15.2,<4.0.0) ; extra == "df-helper" or extra == "complete"
Requires-Dist: folium (>=0.19.4,<0.20.0) ; extra == "geospatial" or extra == "complete"
Requires-Dist: geopandas (>=1.0.1,<2.0.0) ; extra == "geospatial" or extra == "complete"
Requires-Dist: geopy (>=2.4.1,<3.0.0) ; extra == "geospatial" or extra == "complete"
Requires-Dist: gunicorn (>=23.0.0,<24.0.0)
Requires-Dist: httpx (>=0.27.2,<0.28.0)
Requires-Dist: ipython (>=8.29.0,<9.0.0)
Requires-Dist: jinja2 (>=3.1.4,<4.0.0)
Requires-Dist: mysqlclient (>=2.2.6,<3.0.0) ; extra == "df-helper" or extra == "complete"
Requires-Dist: nltk (>=3.9.1,<4.0.0)
Requires-Dist: openpyxl (>=3.1.5,<4.0.0)
Requires-Dist: osmnx (>=2.0.1,<3.0.0) ; extra == "geospatial" or extra == "complete"
Requires-Dist: pandas (>=2.2.3,<3.0.0)
Requires-Dist: paramiko (>=3.5.0,<4.0.0)
Requires-Dist: psutil (>=6.1.0,<7.0.0)
Requires-Dist: psycopg2 (>=2.9.10,<3.0.0) ; extra == "df-helper" or extra == "complete"
Requires-Dist: pyarrow (>=18.0.0,<19.0.0)
Requires-Dist: pydantic (>=2.9.2,<3.0.0)
Requires-Dist: pymysql (>=1.1.1,<2.0.0) ; extra == "df-helper" or extra == "complete"
Requires-Dist: pytest (>=8.3.3,<9.0.0)
Requires-Dist: pytest-mock (>=3.14.0,<4.0.0)
Requires-Dist: python-dotenv (>=1.0.1,<2.0.0)
Requires-Dist: s3fs (>=2024.12.0,<2025.0.0)
Requires-Dist: sqlalchemy (>=2.0.36,<3.0.0) ; extra == "df-helper" or extra == "complete"
Requires-Dist: sqlmodel (>=0.0.22,<0.0.23) ; extra == "df-helper" or extra == "complete"
Requires-Dist: tornado (>=6.4.1,<7.0.0)
Requires-Dist: tqdm (>=4.67.0,<5.0.0)
Requires-Dist: uvicorn (>=0.34.0,<0.35.0)
Requires-Dist: uvicorn-worker (>=0.3.0,<0.4.0)
Requires-Dist: webdav4[fsspec] (>=0.10.0,<0.11.0)
Description-Content-Type: text/markdown

### SIBI-DST

Data Science Toolkit built with Python, Pandas, Dask, OpenStreetMaps, NetworkX, SQLAlchemy, GeoPandas, and Folium.

## Example Use Cases

1. **Build DataCubes, DataSets, and DataObjects** from diverse data sources, including **relational databases, Parquet files, Excel (`.xlsx`), delimited tables (`.csv`, `.tsv`), JSON, and RESTful APIs**.
2. **Comprehensive DataFrame Management** utilities for efficient data handling, transformation, and optimization using **Pandas** and **Dask**.
3. **Flexible Data Sharing** with client applications by writing to **Data Warehouses in Clickhouse, local filesystems, and cloud storage platforms** such as **S3**.
4. **Microservices for Data Access** – Build scalable **API-driven services** using **RESTful APIs (`Django REST Framework`, `FastAPI`)** for high-performance data exchange.
5. **Geospatial Analysis** – Utilize **OpenStreetMaps** and **GeoPandas** for advanced geospatial data processing and visualization.

## Supported Technologies

- **Data Processing**: Pandas, Dask
- **Databases & Storage**: SQLAlchemy, Parquet, S3, Clickhouse
- **Mapping & Geospatial Analysis**: OpenStreetMaps, OSMnx, Geopy
- **API Development**: Django REST Framework, FastAPI

## Installation

```bash
# with pip

pip install sibi-dst[complete]  # Install all dependencies
pip install sibi-dst[df_helper]  # Install only df_helper dependencies
pip install sibi-dst[geospatial]  # Install only geospatial dependencies

# with poetry

poetry add "sibi-dst[complete]"  # Install all dependencies
poetry add "sibi-dst[df_helper]"  # Install only df_helper dependencies
poetry add "sibi-dst[geospatial]"  # Install only geospatial dependencies


```

