This folder contains examples and user specified data files.

The diabetes data set is provided in different formats.
It was taken from https://www.openml.org/search?type=data&sort=runs&id=41519&status=active

Note: The column "class" was renamed to "target".

Number of Instances: 442
Number of Attributes: First 10 columns are numeric predictive values
Target: Column 11 is a quantitative measure of disease progression one year after baseline

It is available in the following formats:

- csv: diabetes.csv
- pkl: diabetes.pkl

Furthermore, it is available in spotPython as a torch DataSet, see:
https://sequential-parameter-optimization.github.io/spotPython/reference/spotPython/data/diabetes/
