axolotl/tests at 383f220cfd658804f4c508a0686c988861ecffbe - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Leonard 87565ecc05 Add chat_template.argilla_chat support for DPO datasets (#3202 )

* Add chat_template.argilla_chat support for DPO datasets

  Creates a new chat_template.argilla_chat prompt strategy for handling
  DPO datasets where chosen/rejected fields contain full conversations
  (messages + final response), following the pattern of chatml.argilla_chat
  and llama3.argilla_chat.

  - Add argilla_chat() function to chat_template.py
  - Add chat_template.argilla_chat to RLHF documentation
  - Add test coverage for argilla_chat with multiple tokenizers

  Dataset format:
  {
    "chosen": [
      {"role": "user", "content": "..."},
      {"role": "assistant", "content": "..."}
    ],
    "rejected": [
      {"role": "user", "content": "..."},
      {"role": "assistant", "content": "..."}
    ]
  }

* Fix chat_template.argilla_chat return value contract and add docstring

- Return (transform_fn, dataset_kwargs) tuple instead of bare transform_fn
- Add remove_columns specification for field_chosen and field_rejected
- Add comprehensive docstring with Args/Returns sections
- Update tests to unpack tuple return value

Addresses PR feedback to maintain consistency with chat_template.default()
and properly specify columns to remove after dataset transformation.

* Update tests/prompt_strategies/test_dpo_chat_templates.py

Co-authored-by: Wing Lian <wing.lian@gmail.com>

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>

2025-10-17 17:00:26 +07:00

..

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

feat:add support dataset_num_processes (#3129 ) [skip ci]

2025-10-13 17:18:12 +07:00

upgrade transformers==4.57.1 and peft==0.23.1 (#3214 )

2025-10-14 15:54:05 -04:00

Respect sequence_len in config for type: llama2_chat (#926 )

2023-12-12 09:39:22 -08:00

text diffusion training plugin (#3067 )

2025-09-10 20:27:00 -04:00

Cp fix (#3182 )

2025-09-25 12:03:50 -04:00

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

prompt_strategies

Add chat_template.argilla_chat support for DPO datasets (#3202 )

2025-10-17 17:00:26 +07:00

Add FSDP v2 swap memory support + QLoRA compatibility fixes (#3167 )

2025-09-26 10:23:59 +01:00

__init__.py

fix: minor patches for multimodal (#2441 )

2025-03-31 13:40:12 +07:00

conftest.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

constants.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

hf_offline_utils.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_chunked_xentropy.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_data.py

Streaming SFT support (#3101 )

2025-09-02 12:08:44 -04:00

test_datasets.py

feat:add support dataset_num_processes (#3129 ) [skip ci]

2025-10-13 17:18:12 +07:00

test_dict.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_exact_deduplication.py

feat:add support dataset_num_processes (#3129 ) [skip ci]

2025-10-13 17:18:12 +07:00

test_expand_mask.py

adding pre-commit auto-update GH action and bumping plugin versions (#2428 )

2025-03-21 11:02:43 -04:00

test_freeze.py

Train parameters exclusively in specific ranges (#1390 )

2024-03-14 11:05:42 -04:00

test_loaders.py

fix: transformers deprecate load_in_Xbit in model_kwargs (#3205 )

2025-10-16 16:07:27 +07:00

test_logging_config_file_capture.py

Debug log, logging improvements (#3159 )

2025-09-17 13:27:03 -04:00

test_lora.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_normalize_config.py

FSDPConfig (#3170 )

2025-10-10 14:44:25 +01:00

test_packed_batch_sampler.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_packed_dataset.py

feat:add support dataset_num_processes (#3129 ) [skip ci]

2025-10-13 17:18:12 +07:00

test_packed_pretraining.py

Streaming SFT support (#3101 )

2025-09-02 12:08:44 -04:00

test_perplexity.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_prompt_tokenizers.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_prompters.py

fix: prompt phi (#1845 ) [skip ci]

2024-08-22 11:46:57 -04:00

test_schedulers.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00

test_streaming.py

text diffusion training plugin (#3067 )

2025-09-10 20:27:00 -04:00

test_tokenizers.py

models.py -> loaders/ module refactor (#2680 )

2025-05-23 15:51:11 -04:00

test_train.py

refactor dupes from merge/rebase (#2919 ) [skip ci]

2025-07-14 10:05:26 -04:00

test_utils_tee.py

Debug log, logging improvements (#3159 )

2025-09-17 13:27:03 -04:00

test_validation_dataset.py

Add ruff, remove black, isort, flake8, pylint (#3092 )

2025-08-23 23:37:33 -04:00