Wing Lian
|
ecc719f5c7
|
add support for base image with uv (#2691)
|
2025-06-02 12:48:55 -07:00 |
|
NanoCode012
|
1c83a1a020
|
feat(doc): clarify minimum pytorch and cuda to use blackwell (#2704) [skip ci]
|
2025-05-22 19:18:27 +07:00 |
|
NanoCode012
|
9eba0ad118
|
chore(doc): update docker tags on doc (#2559) [skip ci]
|
2025-04-25 17:14:48 -04:00 |
|
NanoCode012
|
2c34a4634e
|
feat: add CCE for gemma3, cohere, and cohere2 (#2443)
* feat: add CCE for gemma3 and cohere1/2
* fix: change from relative import to absolute
* feat: add multipack for cohere&cohere2
* chore: improve comments
* fix: add gemma3_text
* feat: add cohere2 example
* fix: cohere forward
* fix: patch for cohere2
* feat: add command r v01 qlora sample
* chore: lint
* feat: upgrade gemma3 and gemma2 patch to use logits_to_keep
* chore: lint
* fix: add deprecate_kwarg decorator
* fix: add cce for gemma3 conditionalgeneration
* fix: gemma3 patch to defer logits calculation
* fix: patch gemma3 if given as model
* fix: remove not working config
* fix: update comments to clarify changes
* feat(doc): add supported models to readme
* fix: address difference in our cohere patch
* feat: add mistral3
* feat: add gemma
* feat(doc): update README to include gemma and mistral3 in supported models
* fix: gemma patch
* fix: import
* fix: gemma patch to be standalone
* fix: gemma3 warn about not support final_logit_softcapping
* feat: add mllama CCE
* chore: add abbireviation to doc
* fix: remove unneeded gemma3 eager warning
* fix: save processor if available
* fix: enable save processor on merge
* fix: wrong env meaning
|
2025-03-26 18:13:51 -04:00 |
|
NanoCode012
|
05dddfc41d
|
feat(doc): add docker images explanation (#2379) [skip ci]
* feat(doc): add docker images explanation
* chore: add link to dockerhub
|
2025-03-05 10:01:00 -05:00 |
|