axolotl/tests at 985819d89bec921e919e7e83042a869f04a25974 - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Keith Stevens 985819d89b Add a chat_template prompt strategy for DPO (#1725 )

* Implementing a basic chat_template strategy for DPO datasets

This mimics the sft chat_template strategy such that users can:
* Specify the messages field
* Specify the per message role and content fields
* speicfy the chosen and rejected fields
* Let the tokenizer construct the raw prompt
* Ensure the chosen and rejected fields don't have any prefix tokens

* Adding additional dpo chat template unittests

* Rename test class

2024-07-21 09:10:42 -04:00

..

ORPO Trainer replacement (#1551 )

2024-04-19 17:25:36 -04:00

fixes to accelerator so that iterable pretraining datasets work (#1759 )

2024-07-17 10:58:38 -04:00

Respect sequence_len in config for type: llama2_chat (#926 )

2023-12-12 09:39:22 -08:00

support for true batches with multipack (#1230 )

2024-02-01 10:18:42 -05:00

prompt_strategies

Add a chat_template prompt strategy for DPO (#1725 )

2024-07-21 09:10:42 -04:00

Add shifted sparse attention (#973 ) [skip-ci]

2024-01-18 10:16:07 -05:00

test_data.py

Fix pretraining with iterable/streaming Dataset (#556 )

2023-09-13 00:16:40 -04:00

test_datasets.py

wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548 )

2024-04-21 19:55:20 -04:00

test_dict.py

Pydantic 2.x cfg (#1239 )

2024-02-26 12:24:14 -05:00

test_expand_mask.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_freeze.py

Train parameters exclusively in specific ranges (#1390 )

2024-03-14 11:05:42 -04:00

test_normalize_config.py

more fixes 20240228 (#1342 ) [skip ci]

2024-02-28 12:57:45 -05:00

test_packed_batch_sampler.py

Switch to parallel FFD bin packing algorithm. (#1619 )

2024-05-23 17:32:14 -04:00

test_packed_dataset.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_packed_pretraining.py

fixes to accelerator so that iterable pretraining datasets work (#1759 )

2024-07-17 10:58:38 -04:00

test_perplexity.py

Phi-3 conversation format, example training script and perplexity metric (#1582 )

2024-06-04 16:11:56 -04:00

test_prompt_tokenizers.py

fix broken linting (#1541 )

2024-04-19 01:03:04 -04:00

test_prompters.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_schedulers.py

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273 )

2024-02-12 21:23:28 -08:00

test_tokenizers.py

Support for additional_special_tokens (#1221 ) [skip ci]

2024-01-31 18:13:13 -05:00

test_validation.py

Add KTO support (#1640 )

2024-05-20 16:05:16 -04:00