axolotl/docs at 54dd7abfc11748802404d0945ed3aa47929302b7 - axolotl - Gitea

tocmo0nlord/axolotl

Files

History

salman 54dd7abfc1 Process reward models (#2241 )

* adding model_cfg to set num_labels

* using a num_labels field instead

* linting

* WIP stepwise prompt tokenizer

* this should work?

* trainer working?

* pushing to runpod

* fixing saving

* updating conf

* updating config, adding docs

* adding stepwise supervision docpage

* updating tests

* adding test for dataset

* fixing tests

* linting

* addressing some comments

* adding additional cfg fields support

* updating tests, fixing cfg

* fixing tests

* updating loss

* Update test_process_reward_model_smollm2.py

* updating loss values and seed

* dumb pre-commit

2025-01-29 00:08:33 -05:00

..

dataset-formats

Process reward models (#2241 )

2025-01-29 00:08:33 -05:00

support for true batches with multipack (#1230 )

2024-02-01 10:18:42 -05:00

.gitignore

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

amd_hpc.qmd

fix build w pyproject to respect insalled torch version (#2168 )

2024-12-10 16:25:25 -05:00

batch_vs_grad.qmd

Feat: update doc (#1475 ) [skip ci]

2024-04-04 13:43:40 +09:00

config.qmd

Process reward models (#2241 )

2025-01-29 00:08:33 -05:00

dataset_preprocessing.qmd

add docs around pre-processing (#1529 )

2024-04-16 19:45:46 -04:00

debugging.qmd

fix build w pyproject to respect insalled torch version (#2168 )

2024-12-10 16:25:25 -05:00

faq.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

fsdp_qlora.qmd

github urls (#1734 )

2024-07-11 09:19:29 -04:00

input_output.qmd

Multimodal Vision Llama - rudimentary support (#1940 )

2024-10-02 21:02:48 -04:00

lr_groups.qmd

support for custom lr groups for non-embedding modules (#2213 )

2025-01-24 12:56:28 -05:00

mac.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

multi-node.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

multimodal.qmd

Multimodal Vision Llama - rudimentary support (#1940 )

2024-10-02 21:02:48 -04:00

multipack.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

nccl.qmd

Bootstrap Hosted Axolotl Docs w/Quarto (#1429 )

2024-03-21 22:28:36 -07:00

reward_modelling.qmd

Process reward models (#2241 )

2025-01-29 00:08:33 -05:00

rlhf.qmd

feat: add kto example (#2158 ) [skip ci]

2024-12-09 08:17:27 -05:00

torchao.qmd

bump transformers and set roundup_power2_divisions for more VRAM improvements, low bit ao optimizers (#1769 )

2024-07-19 00:47:07 -04:00

unsloth.qmd

fix inference when no chat_template is set, fix unsloth dora check (#2092 )

2024-11-20 14:07:54 -05:00