axolotl/docs/dataset-formats/stepwise_supervised.qmd

---
title: Stepwise Supervised Format
description: Format for datasets with stepwise completions and labels
order: 3
---

## Stepwise Supervised

The stepwise supervised format is designed for chain-of-thought (COT) reasoning
datasets where each example contains multiple completion steps and a preference label
for each step.

### Example

Here's a simple example of a stepwise supervised dataset entry:

```json
{
  "prompt": "Which number is larger, 9.8 or 9.11?",
  "completions": [
    "The fractional part of 9.8 is 0.8, while the fractional part of 9.11 is 0.11.",
    "Since 0.11 is greater than 0.8, the number 9.11 is larger than 9.8."
  ],
  "labels": [true, false]
}
```