Wing Lian
|
7232cbdeab
|
chore: lint
|
2025-01-14 22:47:48 -05:00 |
|
Wing Lian
|
a5e0671738
|
make sure to use tensorboard to capture loss for checks
|
2025-01-14 22:47:48 -05:00 |
|
Wing Lian
|
b9847553af
|
fix adapter model check
|
2025-01-14 22:47:48 -05:00 |
|
Wing Lian
|
513ec9e03b
|
make sure to use the correct tokenizer
|
2025-01-14 22:47:48 -05:00 |
|
Wing Lian
|
530347856d
|
make sure to set tokenizer from l3 70b and save safetensors
|
2025-01-14 22:47:47 -05:00 |
|
Wing Lian
|
261e4fb619
|
lower lr
|
2025-01-14 22:47:47 -05:00 |
|
Wing Lian
|
158071e95f
|
set lora_dropout explicitly
|
2025-01-14 22:47:47 -05:00 |
|
Wing Lian
|
432f65f5e6
|
make the kd e2e fit in vram for ci and add lora version
|
2025-01-14 22:47:47 -05:00 |
|
Wing Lian
|
1d039f5486
|
rename test files so it gets picked up
|
2025-01-14 22:47:47 -05:00 |
|