axolotl

Files

Wing Lian 4d09b42ee3 plain input/output prompt strategy w/o chat templates (#1346 )

* plain input/output prompt strategy w/o chat templates

* disable duplicate code check

* make sure to add an eos/eot token to the end of the output so it will stop

* multi turn segement support and test

2024-03-04 16:25:16 -05:00

core

add gptneox embeddings, fix phi2 inputs, also fix the casting (#1083 )

2024-01-10 22:32:43 -05:00

e2e

run tests again on Modal (#1289 ) [skip ci]

2024-02-29 14:26:26 -05:00

fixtures

Respect sequence_len in config for type: llama2_chat (#926 )

2023-12-12 09:39:22 -08:00

monkeypatch

support for true batches with multipack (#1230 )

2024-02-01 10:18:42 -05:00

prompt_strategies

plain input/output prompt strategy w/o chat templates (#1346 )

2024-03-04 16:25:16 -05:00

utils

Add shifted sparse attention (#973 ) [skip-ci]

2024-01-18 10:16:07 -05:00

test_data.py

Fix pretraining with iterable/streaming Dataset (#556 )

2023-09-13 00:16:40 -04:00

test_dict.py

Pydantic 2.x cfg (#1239 )

2024-02-26 12:24:14 -05:00

test_expand_mask.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_normalize_config.py

more fixes 20240228 (#1342 ) [skip ci]

2024-02-28 12:57:45 -05:00

test_packed_batch_sampler.py

support for true batches with multipack (#1230 )

2024-02-01 10:18:42 -05:00

test_packed_dataset.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_packed_pretraining.py

Pretrain transforms (#1261 )

2024-02-06 00:37:03 -05:00

test_prompt_tokenizers.py

Pydantic 2.x cfg (#1239 )

2024-02-26 12:24:14 -05:00

test_prompters.py

Attention mask and position id fixes for packing (#285 )

2023-08-12 15:14:56 -04:00

test_schedulers.py

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273 )

2024-02-12 21:23:28 -08:00

test_tokenizers.py

Support for additional_special_tokens (#1221 ) [skip ci]

2024-01-31 18:13:13 -05:00

test_validation.py

fix for protected model_ namespace w pydantic (#1345 )

2024-02-28 15:07:49 -05:00