Hamel Husain
304ea1b814
Update debugging.md ( #1111 )
2024-01-12 21:07:31 -08:00
Hamel Husain
2dc431078c
Add link on README to Docker Debugging ( #1107 )
...
* add docker debug
* Update docs/debugging.md
Co-authored-by: Wing Lian <wing.lian@gmail.com >
* explain editable install
* explain editable install
* upload new video
* add link to README
* Update README.md
* Update README.md
* chore: lint
* make sure to lint markdown too
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com >
2024-01-12 08:51:35 -05:00
Hamel Husain
6d342b52a4
Add section for debugging with Docker ( #1104 )
...
* add docker debug
* Update docs/debugging.md
Co-authored-by: Wing Lian <wing.lian@gmail.com >
* explain editable install
* explain editable install
* upload new video
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com >
2024-01-11 18:43:33 -08:00
NanoCode012
b432889256
feat: enable trl's autounwrap ( #1060 )
...
* feat: test trl's autounwrap
* fix: add check for adapter
* feat: add config to disable autounwrap
* chore: fix lint
2024-01-11 08:43:41 -05:00
Hamel Husain
54fe07a905
Fix debugging.md ( #1091 )
2024-01-10 21:44:40 -08:00
Hamel Husain
7512c3ad20
Add Debugging Guide ( #1089 )
...
* add debug guide
* add background
* add .gitignore
* Update devtools/dev_sharegpt.yml
Co-authored-by: Wing Lian <wing.lian@gmail.com >
* Update docs/debugging.md
Co-authored-by: Wing Lian <wing.lian@gmail.com >
* simplify example axolotl config
* add additional comments
* add video and TOC
* try jsonc for better md rendering
* style video thumbnail better
* fix footnote
---------
Co-authored-by: Wing Lian <wing.lian@gmail.com >
2024-01-10 20:49:24 -08:00
Wing Lian
f243c2186d
RL/DPO ( #935 )
...
* ipo-dpo trainer
* fix missing abstract method
* chatml template, grad checkpointing kwargs support
* fix steps calc for RL and add dataloader kwargs
* wip to fix dpo and start ppo
* more fixes
* refactor to generalize map fn
* fix dataset loop and handle argilla pref dataset
* set training args
* load reference model on seperate gpu if more than one device
* no auto upload to hub for dpo, don't add lora adapters to ref model for dpo
* fixes for rl training
* support for ipo from yaml
* set dpo training args from the config, add tests
* chore: lint
* set sequence_len for model in test
* add RLHF docs
2024-01-04 18:22:55 -05:00
NanoCode012
9f7e8a971d
feat(doc): add dummyoptim faq fix ( #802 )
2023-10-29 23:06:06 +09:00
Wing Lian
a21935f07a
add to docs ( #703 )
2023-10-19 21:32:30 -04:00
Wing Lian
2aa1f71464
fix pytorch 2.1.0 build, add multipack docs ( #722 )
2023-10-13 08:57:28 -04:00
Maxime
c1382e79b6
Create multi-node.md ( #613 )
...
* Create multi-node.md
* Update multi-node.md
* Update multi-node.md
2023-09-20 22:02:16 -04:00
The Objective Dad
5e2d8a42d9
Adding NCCL Timeout Guide ( #536 )
...
* fixes NCCL_P2P_LEVEL=NVL #429
* adding more insights into verious values of NCCL_P2P_LEVEL
2023-09-08 11:57:47 -04:00