Metadata-Version: 2.3
Name: dataidea
Version: 0.1.22
Summary: A package to simplify common data analysis tasks
License: MIT
Author: jumashafara
Author-email: jumashafara0@gmail.com
Requires-Python: >=3.10,<4.0
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Requires-Dist: joblib (>=1.3.2,<2.0.0)
Requires-Dist: numpy (>=2.2.5,<3.0.0)
Requires-Dist: pandas (>=2.2.3,<3.0.0)
Requires-Dist: plotly (>=6.0.1,<7.0.0)
Requires-Dist: python-dotenv (>=1.1.0,<2.0.0)
Requires-Dist: requests (>=2.32.3,<3.0.0)
Requires-Dist: rich (>=14.0.0,<15.0.0)
Requires-Dist: scikit-learn (>=1.6.1,<2.0.0)
Requires-Dist: scipy (>=1.12.0,<2.0.0)
Requires-Dist: yt-dlp (>=2025.3.31,<2026.0.0)
Project-URL: Documentation, https://docs.dataidea.org
Project-URL: Repository, https://github.com/jumashafara/dataidea
Description-Content-Type: text/markdown

# DataIdea

A Python package to simplify common data analysis tasks and workflows.

## Overview

DataIdea provides utilities for data analysts to streamline repetitive tasks in the data analysis process. It builds upon popular libraries like pandas, numpy, and scikit-learn to offer a more user-friendly interface for common operations.

## Features

- **Dataset Management**: Easily load built-in datasets or your own custom data
- **Model Persistence**: Simple functions to save and load machine learning models
- **Performance Monitoring**: Time your function execution with the `timer` decorator
- **Logging Utilities**: Log events and LLM interactions via our API
- **YouTube Integration**: Download video data for analysis

## Installation

```bash
pip install dataidea
```

For development:
```bash
git clone https://github.com/dataideaorg/dataidea.git
cd dataidea
poetry install
```

## Quick Start

```python
import dataidea as di

# Load a built-in dataset
df = di.load_dataset('titanic')

# Save a machine learning model
from sklearn.ensemble import RandomForestClassifier
model = RandomForestClassifier().fit(X, y)
di.save_model(model, 'model.di')

# Load the model
loaded_model = di.load_model('model.di')

# Time a function
from dataidea.utils import timer
@timer
def process_data(data):
    return processed_data

# Download YouTube video
from dataidea.io import download_youtube
download_youtube(url="https://www.youtube.com/watch?v=example", output_folder="videos")
```

## Available Datasets

DataIdea includes datasets for practice: `titanic`, `boston`, `fpl`, `mall`, `air_passengers`, `melbourne`, and more.

## Documentation

For detailed documentation, visit [https://docs.dataidea.org](https://docs.dataidea.org).

## License

MIT License
