Dan Saunders
|
ad4cd39bcd
|
remove contig
|
2025-09-18 11:55:15 -04:00 |
|
Dan Saunders
|
5c197275ad
|
inplace
|
2025-09-18 11:51:17 -04:00 |
|
Dan Saunders
|
19c91e3675
|
refactor
|
2025-09-18 11:44:21 -04:00 |
|
Dan Saunders
|
2a176e4923
|
fix
|
2025-09-18 11:29:33 -04:00 |
|
Dan Saunders
|
7d867de9b2
|
refactor
|
2025-09-18 11:23:15 -04:00 |
|
Dan Saunders
|
01b6792c2e
|
refactor
|
2025-09-18 11:20:08 -04:00 |
|
Dan Saunders
|
bbf1f14ca4
|
dtype issues
|
2025-09-17 23:52:18 +00:00 |
|
Dan Saunders
|
c6878beb7d
|
simplify
|
2025-09-17 19:15:34 -04:00 |
|
Dan Saunders
|
e62979d11d
|
fix
|
2025-09-17 18:53:07 -04:00 |
|
Dan Saunders
|
d57b9c67c2
|
log
|
2025-09-17 18:52:27 -04:00 |
|
Dan Saunders
|
eaaf16aa00
|
cumulative offsets
|
2025-09-17 18:45:15 -04:00 |
|
Dan Saunders
|
f3b953e222
|
fix?
|
2025-09-17 18:42:10 -04:00 |
|
Dan Saunders
|
7935dc0911
|
dtype fix
|
2025-09-17 18:36:22 -04:00 |
|
Dan Saunders
|
d2b49b2670
|
error msg
|
2025-09-17 18:29:30 -04:00 |
|
Dan Saunders
|
b5cb345ca4
|
fix test
|
2025-09-17 18:24:00 -04:00 |
|
Dan Saunders
|
03d4c2683e
|
fix perf degradation
|
2025-09-17 18:20:37 -04:00 |
|
Dan Saunders
|
fd87eed501
|
minify
|
2025-09-17 16:42:35 -04:00 |
|
Dan Saunders
|
129db67705
|
fix
|
2025-09-17 16:24:29 -04:00 |
|
Dan Saunders
|
38b890a36b
|
fix
|
2025-09-17 16:16:41 -04:00 |
|
Dan Saunders
|
180920c7bf
|
simplify
|
2025-09-17 19:49:18 +00:00 |
|
Dan Saunders
|
d024048d74
|
logs + fix
|
2025-09-17 14:50:49 -04:00 |
|
Dan Saunders
|
98dc945838
|
fix
|
2025-09-17 14:42:53 -04:00 |
|
Dan Saunders
|
108600cd69
|
update config
|
2025-09-17 14:36:24 -04:00 |
|
Dan Saunders
|
0e9387c395
|
fix
|
2025-09-17 14:35:36 -04:00 |
|
Dan Saunders
|
db61e0d4ff
|
fix
|
2025-09-17 14:26:25 -04:00 |
|
Dan Saunders
|
51e565f60a
|
logs
|
2025-09-17 14:15:51 -04:00 |
|
Dan Saunders
|
c774dd0409
|
refactor + fix
|
2025-09-17 14:01:39 -04:00 |
|
Dan Saunders
|
7289e0cb55
|
more logs
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
8d483c11f7
|
more logs
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
9c1829cf57
|
more logs
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
135b09d1de
|
logs, qwen2 support
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
de4344a56e
|
patch
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
7d572b58d1
|
just grouped_mm for now
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
773d7e4291
|
update
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
fef47a5b7c
|
hardening
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
f6ed8ddc01
|
fix
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
556d6448fe
|
fix
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
5c2229721d
|
diag
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
d7de6b0e96
|
grouped_mm
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
3c6648678f
|
numerics
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
5b19a1ea9c
|
improve
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
cfefad1eea
|
fix
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
125e7b5fe6
|
fast path
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
479b6144df
|
tflops
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
68da65cba2
|
update
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
0d689bb421
|
cache, example
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
43ada1278a
|
moe kernels init scaffold
|
2025-09-17 13:44:26 -04:00 |
|
Dan Saunders
|
4065bc14c6
|
Debug log, logging improvements (#3159)
* simplify logging
* remove comment
* progress on debug.log
* add debug-level logger for file log
* simplify
* case insensitivity; 3rd party logging improvements
* simplify
* fix
* tests
* lint
* nits
* nit
* Update tests/test_utils_tee.py
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
* cleanup / comments
* fix
* oops
---------
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
|
2025-09-17 13:27:03 -04:00 |
|
salman
|
e5c427f6de
|
qat doc updates (#3162) [skip-ci]
|
2025-09-17 10:38:15 +01:00 |
|
Wing Lian
|
86d6ee7c05
|
upgrade trl and accelerate (#3161)
* upgrade trl==0.23.0
* upgrade accelerate patch fix
* add hints when using gradient_checkpointing with DPO
* set gradient-checpointing properly
|
2025-09-16 14:53:01 -04:00 |
|