Satpal Singh Rathore
|
c19d060a74
|
turn sample_packing on for training (#1438) [skip ci]
|
2024-03-26 15:19:04 -04:00 |
|
Sebastian Raschka
|
6366b0c212
|
Fix Gemma 7b qlora.yml (#1405)
|
2024-03-14 15:44:38 -04:00 |
|
NanoCode012
|
170d4d7092
|
chore: enable sample_packing for Gemma (#1351)
|
2024-03-01 21:56:22 -05:00 |
|
Wing Lian
|
2752d5f958
|
multipack for gemma (#1313)
* multipack for gemma
* chore: lint
* handle cache_position kwarg in updated llama modeling
* add position_ids to rotary embed call for updated llama modeling
|
2024-02-21 19:24:21 -05:00 |
|
Monk
|
9e300aca0c
|
Adding Google's gemma Model (#1312)
|
2024-02-21 12:56:47 -05:00 |
|