Sunny Liu
|
1c14c4a15c
|
Add hub model id config options to all example yml files (#2196) [skip ci]
* added hub model_id in example yml
* add hub model id to example yml
|
2024-12-17 11:24:30 -05:00 |
|
NanoCode012
|
8c3a727f9d
|
feat: update yml chat_template to specify dataset field (#2001) [skip ci]
* feat: update yml chat_template to specify dataset field
* feat: replace sharegpt references with chat_template
|
2024-10-29 10:26:03 -04:00 |
|
Gal Cohen (galco)
|
957c956f89
|
rename jamba example (#1846) [skip ci]
* rename jamba example
* feat: change readme
---------
Co-authored-by: Gal Cohen <galc@ai21.com>
|
2024-08-22 09:22:55 -04:00 |
|
Gal Cohen (galco)
|
9f917245f6
|
feat: add jamba chat_template (#1843)
* feat: add jamba chat_template
* fix: black
* feat: jamba fsdp+qlora
---------
Co-authored-by: Gal Cohen <galc@ai21.com>
|
2024-08-21 13:37:17 -04:00 |
|
Wing Lian
|
4fde300e5f
|
update outputs path so that we can mount workspace to /workspace/data (#1623)
* update outputs path so that we can mount workspace to /workspace/data
* fix ln order
|
2024-05-15 12:44:13 -04:00 |
|
Wing Lian
|
05b398a072
|
fix some of the edge cases for Jamba (#1452)
* fix some of the edge cases for Jamba
* update requirements for jamba
|
2024-03-29 02:38:02 -04:00 |
|
Wing Lian
|
02af0820f7
|
Jamba (#1451)
* fixes for larger models
* add qlora example for deepspeed
* add readme for jamba
|
2024-03-28 21:03:22 -04:00 |
|