Input Data Format ========================= ## Automated Validation {{ rime_data_format_check_redirect }} --- ## Supported File Formats RIME Tabular currently supports both CSV (`.csv`) and [Parquet](https://parquet.apache.org/docs/file-format/) (`.parquet`) file formats, with task-specific nuances defined below. Input files should have header columns in string format --- these will be used as feature names. RIME is most effective when both label and prediction column are provided; however, neither are required for most tasks\*. ## Requirements By Task ### Binary Classification - Labels should be integer values 0 or 1 - Predictions should be float values between 0 and 1 that represent the positive class (label = 1) probability ### Multi-Class Classification - Labels should be integers referring to class index - Predictions should be uploaded as a separate `.csv` or `.parquet` file. Columns should be ordered, with the **i**th column representing the probability of the **i**th class. Predictions should sum to 1. ### Ranking - \* **Labels are required** - Labels should be any real number - Predictions should be any real number - `ranking_info` must be provided in the [data configuration](data_source.md) ### Regression - Labels should be any real number - Predictions should be any real number