Metadata-Version: 2.3
Name: datacleanx
Version: 0.1.0
Summary: Fast data cleaning engine for ML projects
License: MIT
Keywords: data-cleaning,ML,CLI,preprocessing
Author: Esther
Author-email: esther@example.com
Requires-Python: >=3.12,<4.0
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Requires-Dist: numpy (>=1.26,<2.0)
Requires-Dist: pandas (>=2.2,<3.0)
Requires-Dist: scikit-learn (>=1.5,<2.0)
Project-URL: Homepage, https://github.com/essiebx/datacleanx
Project-URL: Repository, https://github.com/essiebx/datacleanx
Description-Content-Type: text/markdown

# 🧼 datacleanx

**datacleanx** is a fast, CLI-first data cleaning engine for tabular datasets. It's designed for machine learning practitioners and data engineers who want to automate cleaning workflows efficiently using a single command-line interface.

---

## 🚀 Why datacleanx?

- 🔁 Automates repetitive cleaning steps
- 📦 Works out-of-the-box with CSV files
- 📁 Outputs timestamped cleaned files and reports
- 🐳 Docker-ready for CI/CD and containerized workflows
- 🧪 Includes tests and reports for reproducibility

---

## 🔧 Features

- ✅ Imputation: `mean`, `median`, `mode`
- ✅ Encoding: `label`, `onehot`
- ✅ Outlier removal using IQR
- ✅ Feature scaling: `standard`, `minmax`, `robust`
- ✅ Auto-saves cleaned data to `outputs/`
- ✅ Saves reports as structured JSON
- ✅ CLI-first design, easily scriptable
- ✅ Docker and Poetry integration

---

## 📦 Installation

### ✅ Option 1: From PyPI

```bash
pip install datacleanx

