NanoCode012
1115c501b8
Feat: Add Qwen (#894)
* Feat: Add Qwen
* feat: add qwen lora example
* feat: update matrix
* fix: add trust_remote_code
* fix: disable gradient checkpointing
* chore: add warning about gradient checkpointing
* fix: config
* fix: turn off sample packing for this example and reduce seq len
* chore: add comment on seq len
2023-11-26 00:05:01 +09:00
..
2023-11-08 19:49:55 -05:00
2023-11-08 19:49:55 -05:00
2023-11-08 19:49:55 -05:00
2023-11-08 19:49:55 -05:00
2023-10-27 22:36:30 -04:00
2023-11-15 12:23:18 -05:00
2023-11-08 19:49:55 -05:00
2023-10-27 22:36:30 -04:00
2023-11-08 19:49:55 -05:00
2023-11-17 12:47:17 -05:00
2023-10-27 22:36:30 -04:00
2023-10-23 01:42:38 -04:00
2023-11-26 00:05:01 +09:00
2023-10-27 22:36:30 -04:00
2023-10-27 22:36:30 -04:00
2023-11-08 19:49:55 -05:00