axolotl/tests at 203816f7b4de020c40708e4e61847b0716189380 - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

Adam Brusselback 55cc214c76 Add flexible configuration options for chat_template dataset training (#1756 )

* Add flexible configuration options for chat dataset training

- Introduce roles_to_train parameter to set training labels by role
- Add train_on_eos option to configure training on end-of-sequence tokens
- Implement per-message training configuration in dataset
- Allow fine-grained control over training specific portions of messages
- Add message_field_training and message_field_training_detail settings
- Implement mapping between dataset character offsets and tokenized prompt
- Enhance test suite to cover new functionality

* Fix missing field inits, things weren't working from yaml.

* Add flexible configuration options for chat dataset training

- Introduce roles_to_train parameter to set training labels by role
- Add train_on_eos option to configure training on end-of-sequence tokens
- Implement per-message training configuration in dataset
- Allow fine-grained control over training specific portions of messages
- Add message_field_training and message_field_training_detail settings
- Implement mapping between dataset character offsets and tokenized prompt
- Enhance test suite to cover new functionality

* Fix missing field inits, things weren't working from yaml.

* chore: lint

* Revert test repo back to NousResearch after opening PR to fix the tokenizer_config.json.

---------

Co-authored-by: Wing Lian <wing.lian@gmail.com>

2024-07-28 21:48:57 -04:00

..

ORPO Trainer replacement (#1551 )

2024-04-19 17:25:36 -04:00

Bump deepspeed 20240727 (#1790 )

2024-07-27 10:24:11 -04:00

Respect sequence_len in config for type: llama2_chat (#926 )

2023-12-12 09:39:22 -08:00

support for true batches with multipack (#1230 )

2024-02-01 10:18:42 -05:00

prompt_strategies

Add flexible configuration options for chat_template dataset training (#1756 )

2024-07-28 21:48:57 -04:00

Add shifted sparse attention (#973 ) [skip-ci]

2024-01-18 10:16:07 -05:00

test_data.py

Fix pretraining with iterable/streaming Dataset (#556 )

2023-09-13 00:16:40 -04:00

test_datasets.py

wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548 )

2024-04-21 19:55:20 -04:00

test_dict.py

Pydantic 2.x cfg (#1239 )

2024-02-26 12:24:14 -05:00

test_expand_mask.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_freeze.py

Train parameters exclusively in specific ranges (#1390 )

2024-03-14 11:05:42 -04:00

test_normalize_config.py

more fixes 20240228 (#1342 ) [skip ci]

2024-02-28 12:57:45 -05:00

test_packed_batch_sampler.py

Switch to parallel FFD bin packing algorithm. (#1619 )

2024-05-23 17:32:14 -04:00

test_packed_dataset.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_packed_pretraining.py

fixes to accelerator so that iterable pretraining datasets work (#1759 )

2024-07-17 10:58:38 -04:00

test_perplexity.py

Phi-3 conversation format, example training script and perplexity metric (#1582 )

2024-06-04 16:11:56 -04:00

test_prompt_tokenizers.py

fix broken linting (#1541 )

2024-04-19 01:03:04 -04:00

test_prompters.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_schedulers.py

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273 )

2024-02-12 21:23:28 -08:00

test_tokenizers.py

Support for additional_special_tokens (#1221 ) [skip ci]

2024-01-31 18:13:13 -05:00

test_validation.py

Add KTO support (#1640 )

2024-05-20 16:05:16 -04:00