axolotl/tests at 9135b9e2aa30614d4bba1db9b06307cfa5691deb - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Wing Lian 7523d1f557 DPO cleanup (#1126 )

* cleanup dpo to be a little more extensible, add zephyr/nectar strategy

* fix eos slash

* support for eval split

* fix kwargs

* handle empty evals

* don't load peft model for dpo

* ensure dpo traning args gets bf16 for peft if applicable

* fix duplicate kwargs for bf16

* make sure to respect the configured lr scheduler

* supprt trainer callback to push config to wandb

* set dataloader preload args

* ensure that we are loading the lora when merging

* Update src/axolotl/utils/data.py

Co-authored-by: Agus <agustin.piqueres@gmail.com>

* support local datasets for dpo

Co-authored-by: Agus <agustin.piqueres@gmail.com>

* chore: lint

* dpo/kto/ipo smoke tests w lora, simplify dpo dataset type names

* add split to dpo tests

* fix rebase/merging error

* handle edge case w logging

* use accelerator for dpo datasets so it doesn't break the logger

* missing args

* validate checkpoint is an adapter for now

* log warning when dataset strategy is not loadable

---------

Co-authored-by: Agus <agustin.piqueres@gmail.com>

2024-01-23 00:40:37 -05:00

..

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083 )

2024-01-10 22:32:43 -05:00

DPO cleanup (#1126 )

2024-01-23 00:40:37 -05:00

Respect sequence_len in config for type: llama2_chat (#926 )

2023-12-12 09:39:22 -08:00

Multipack simplify for Mixtral (#1142 )

2024-01-18 16:23:49 -05:00

prompt_strategies

Feat(test): Add tests for alpaca chatml prompt tokenizer (#1088 )

2024-01-23 13:30:26 +09:00

Add shifted sparse attention (#973 ) [skip-ci]

2024-01-18 10:16:07 -05:00

test_data.py

Fix pretraining with iterable/streaming Dataset (#556 )

2023-09-13 00:16:40 -04:00

test_dict.py

fix DefaultDict.__or__

2023-08-13 01:15:50 +00:00

test_expand_mask.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_normalize_config.py

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122 ) [skip ci]

2024-01-22 18:44:01 -05:00

test_packed_dataset.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_packed_pretraining.py

streaming multipack for pretraining dataset (#959 )

2024-01-05 22:13:21 -05:00

test_prompt_tokenizers.py

fix mistral prompt assembly (#982 )

2023-12-21 08:00:55 -08:00

test_prompters.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_tokenizers.py

Feat: Warns to add to modules_to_save when adding tokens or switching special_tokens (#787 )

2023-12-22 21:49:07 +09:00

test_validation.py

Deprecate max packed sequence len (#1141 )

2024-01-20 05:11:50 -05:00