axolotl/docs at 87565ecc05f1b8fd1f8b907dd750d3a5d09adf9a - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Leonard 87565ecc05 Add chat_template.argilla_chat support for DPO datasets (#3202 )

* Add chat_template.argilla_chat support for DPO datasets

  Creates a new chat_template.argilla_chat prompt strategy for handling
  DPO datasets where chosen/rejected fields contain full conversations
  (messages + final response), following the pattern of chatml.argilla_chat
  and llama3.argilla_chat.

  - Add argilla_chat() function to chat_template.py
  - Add chat_template.argilla_chat to RLHF documentation
  - Add test coverage for argilla_chat with multiple tokenizers

  Dataset format:
  {
    "chosen": [
      {"role": "user", "content": "..."},
      {"role": "assistant", "content": "..."}
    ],
    "rejected": [
      {"role": "user", "content": "..."},
      {"role": "assistant", "content": "..."}
    ]
  }

* Fix chat_template.argilla_chat return value contract and add docstring

- Return (transform_fn, dataset_kwargs) tuple instead of bare transform_fn
- Add remove_columns specification for field_chosen and field_rejected
- Add comprehensive docstring with Args/Returns sections
- Update tests to unpack tuple return value

Addresses PR feedback to maintain consistency with chat_template.default()
and properly specify columns to remove after dataset transformation.

* Update tests/prompt_strategies/test_dpo_chat_templates.py

Co-authored-by: Wing Lian <wing.lian@gmail.com>

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>

2025-10-17 17:00:26 +07:00

..

dataset-formats

feat: support training with JSON string tool arguments (#3136 )

2025-09-25 12:06:21 +07:00

Ray Train Axolotl Integration (#2251 )

2025-01-29 00:10:19 -05:00

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

.gitignore

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

amd_hpc.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

batch_vs_grad.qmd

Feat: update doc (#1475 ) [skip ci]

2024-04-04 13:43:40 +09:00

cli.qmd

CLI: add --launcher option, support launcher args, cleanup, refactor (#2924 )

2025-07-30 15:46:56 -04:00

custom_integrations.qmd

densemixer plugin integration (#2868 )

2025-07-07 17:05:19 -04:00

dataset_loading.qmd

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

dataset_preprocessing.qmd

Autodoc generation with quartodoc (#2419 )

2025-03-21 12:26:47 -04:00

debugging.qmd

feat:add support dataset_num_processes (#3129 ) [skip ci]

2025-10-13 17:18:12 +07:00

docker.qmd

feat(doc): re-add docker 2.7.0 tag back (#2902 ) [skip ci]

2025-07-12 11:40:01 -04:00

faq.qmd

feat: support training with JSON string tool arguments (#3136 )

2025-09-25 12:06:21 +07:00

fsdp_qlora.qmd

Add FSDP v2 swap memory support + QLoRA compatibility fixes (#3167 )

2025-09-26 10:23:59 +01:00

getting-started.qmd

Config doc autogen (#2718 )

2025-06-18 15:36:53 -04:00

gradient_checkpointing.qmd

Activation Offloading w CUDA Streams (#2900 ) [skip ci]

2025-07-14 20:10:20 -04:00

inference.qmd

Feat: minor docs improvements for RLHF and faq on embeddings (#2401 ) [skip ci]

2025-03-17 08:39:04 -04:00

input_output.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

installation.qmd

feat(doc): improve visibility for colab notebooks (#3110 ) [skip ci]

2025-09-03 01:40:53 -04:00

lora_optims.qmd

doc fix (#3187 )

2025-09-26 09:55:15 -04:00

lr_groups.qmd

support for custom lr groups for non-embedding modules (#2213 )

2025-01-24 12:56:28 -05:00

mac.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

mixed_precision.qmd

basic torchao fp8 mixed precision training (#2926 )

2025-07-22 16:27:47 -04:00

multi-gpu.qmd

fix(doc): add act checkpointing migration to fsdp2 docs (#3193 ) [skip ci]

2025-10-10 10:57:50 +07:00

multi-node.qmd

CLI: add --launcher option, support launcher args, cleanup, refactor (#2924 )

2025-07-30 15:46:56 -04:00

multimodal.qmd

fix: unify default for conversations_field [skip-e2e] (#3070 )

2025-09-23 21:22:15 +07:00

multipack.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

nccl.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

nd_parallelism.qmd

feat: update nd parallelism readme (#3039 )

2025-08-08 12:45:36 +01:00

optimizations.qmd

feat(doc): add optimizations table of content to our improvements (#3175 ) [skip ci]

2025-09-24 16:13:49 -04:00

optimizers.qmd

feat: add complete optimizer docs (#3017 ) [skip ci]

2025-08-06 08:01:51 -04:00

qat.qmd

feat(doc): add optimizations table of content to our improvements (#3175 ) [skip ci]

2025-09-24 16:13:49 -04:00

quantize.qmd

qat doc updates (#3162 ) [skip-ci]

2025-09-17 10:38:15 +01:00

ray-integration.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

reward_modelling.qmd

Center rewards coefficient (#3124 )

2025-09-03 16:22:37 -04:00

rlhf.qmd

Add chat_template.argilla_chat support for DPO datasets (#3202 )

2025-10-17 17:00:26 +07:00

sequence_parallelism.qmd

Distributed/ND-Parallel (#2977 )

2025-07-31 15:25:02 -04:00

streaming.qmd

Streaming SFT support (#3101 )

2025-09-02 12:08:44 -04:00

torchao.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00

unsloth.qmd

Feat(doc): Reorganize documentation, fix broken syntax, update notes (#2348 )

2025-02-25 16:09:37 +07:00