feat(doc): add split_thinking docs (#2613) [skip ci]
* feat(doc): add split_thinking docs * fix: link config.qmd to conversation.qmd for split_thinking example * update thinking => reasoning_content in messages format --------- Co-authored-by: Wing Lian <wing@axolotl.ai>
This commit is contained in:
@@ -184,8 +184,8 @@ datasets:
|
|||||||
# adding a system turn with empty content.
|
# adding a system turn with empty content.
|
||||||
drop_system_message:
|
drop_system_message:
|
||||||
|
|
||||||
# Optional[bool]. Whether to split the assistant turn based on a reasoning trace inside delimited tags
|
# Optional[bool]. (for Qwen3 template only) Whether to split the assistant content based on a reasoning trace inside delimited tags
|
||||||
# defaults to False
|
# See example at `docs/dataset-formats/conversation.qmd`
|
||||||
split_thinking:
|
split_thinking:
|
||||||
|
|
||||||
# IMPORTANT: The following fields determine which parts of the conversation to train on.
|
# IMPORTANT: The following fields determine which parts of the conversation to train on.
|
||||||
|
|||||||
@@ -196,6 +196,34 @@ datasets:
|
|||||||
It is not necessary to set both `message_field_training` and `message_field_training_detail` at once.
|
It is not necessary to set both `message_field_training` and `message_field_training_detail` at once.
|
||||||
:::
|
:::
|
||||||
|
|
||||||
|
8. (For Qwen3 template only) Enable reasoning split, where the reasoning is split from the content and passed as a separate field into the template.
|
||||||
|
|
||||||
|
```yaml
|
||||||
|
datasets:
|
||||||
|
- path: ...
|
||||||
|
type: chat_template
|
||||||
|
chat_template: qwen3
|
||||||
|
split_thinking: true
|
||||||
|
```
|
||||||
|
|
||||||
|
For example, a content can look like:
|
||||||
|
|
||||||
|
```json
|
||||||
|
{
|
||||||
|
"content": "<think>Some thinking outputs</think>Output after thinking."
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
After split, it will look like:
|
||||||
|
|
||||||
|
```json
|
||||||
|
{
|
||||||
|
"reasoning_content": "Some thinking outputs",
|
||||||
|
"content": "Output after thinking..."
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
|
||||||
## sharegpt
|
## sharegpt
|
||||||
|
|
||||||
::: {.callout-important}
|
::: {.callout-important}
|
||||||
|
|||||||
Reference in New Issue
Block a user