axolotl/tests/e2e at 54dd7abfc11748802404d0945ed3aa47929302b7 - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

salman 54dd7abfc1 Process reward models (#2241 )

* adding model_cfg to set num_labels

* using a num_labels field instead

* linting

* WIP stepwise prompt tokenizer

* this should work?

* trainer working?

* pushing to runpod

* fixing saving

* updating conf

* updating config, adding docs

* adding stepwise supervision docpage

* updating tests

* adding test for dataset

* fixing tests

* linting

* addressing some comments

* adding additional cfg fields support

* updating tests, fixing cfg

* fixing tests

* updating loss

* Update test_process_reward_model_smollm2.py

* updating loss values and seed

* dumb pre-commit

2025-01-29 00:08:33 -05:00

..

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

support for latest transformers release 4.48.1 (#2256 )

2025-01-23 21:17:57 -05:00

removing 2.3.1 (#2294 )

2025-01-28 23:23:44 -05:00

support for latest transformers release 4.48.1 (#2256 )

2025-01-23 21:17:57 -05:00

__init__.py

missing dunder-init

2023-11-06 18:33:01 -05:00

.gitignore

Support Sample packing for phi arch (#586 )

2023-09-15 15:46:54 -04:00

test_dpo.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_embeddings_lr.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_falcon.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_imports.py

Bump deepspeed 20240727 (#1790 )

2024-07-27 10:24:11 -04:00

test_llama_pretrain.py

Pretrain multipack (#2278 )

2025-01-24 12:55:20 -05:00

test_llama_vision.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_llama.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_load_model.py

Refactor func load_model to class ModelLoader (#1909 )

2024-10-25 09:06:56 -04:00

test_lora_llama.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_mamba.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_mistral.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_mixtral.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_optimizers.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_packing_loss.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_phi.py

CLI cleanup and documentation (#2244 )

2025-01-13 17:55:29 +00:00

test_process_reward_model_smollm2.py

Process reward models (#2241 )

2025-01-29 00:08:33 -05:00

test_qwen.py

remove the bos token from dpo outputs (#1733 ) [skip ci]

2024-11-15 19:09:20 -05:00

test_reward_model_smollm2.py

Process reward models (#2241 )

2025-01-29 00:08:33 -05:00

utils.py

removing 2.3.1 (#2294 )

2025-01-28 23:23:44 -05:00