Wing Lian
|
4a0ab11fcf
|
chore: lint
|
2025-01-13 14:05:56 -05:00 |
|
Wing Lian
|
9db5072407
|
make sure to use tensorboard to capture loss for checks
|
2025-01-13 13:56:16 -05:00 |
|
Wing Lian
|
42d3e36a6f
|
fix adapter model check
|
2025-01-13 13:56:15 -05:00 |
|
Wing Lian
|
b12d93bedf
|
make sure to use the correct tokenizer
|
2025-01-13 13:56:15 -05:00 |
|
Wing Lian
|
08ec9c0e5b
|
make sure to set tokenizer from l3 70b and save safetensors
|
2025-01-13 13:56:15 -05:00 |
|
Wing Lian
|
9abac55f92
|
lower lr
|
2025-01-13 13:56:15 -05:00 |
|
Wing Lian
|
800e7fa41e
|
set lora_dropout explicitly
|
2025-01-13 13:56:15 -05:00 |
|
Wing Lian
|
5a1c1b82d4
|
make the kd e2e fit in vram for ci and add lora version
|
2025-01-13 13:56:15 -05:00 |
|
Wing Lian
|
efb3f70d38
|
rename test files so it gets picked up
|
2025-01-13 13:56:15 -05:00 |
|