Wing Lian
00dee05fc6
support flattening/packing for GRPO ( #3552 )
...
* support flattening/packing for GRPO
* more flattening
* fix tests
* improve dead vllm handling
* refactor out process handling for vllm serve and move bench flattening tests to gpu tests
* add validation for flattening with liger
* isolate batch flattening test
* flaky test
2026-03-28 13:15:54 -04:00
..
2026-03-21 22:46:10 -04:00
2026-03-23 02:26:10 -04:00
2026-03-22 13:53:19 -04:00
2026-03-22 13:53:19 -04:00
2026-03-28 13:15:54 -04:00
2023-11-06 18:33:01 -05:00
2023-09-15 15:46:54 -04:00
2026-01-27 17:08:24 -05:00
2026-03-21 22:46:10 -04:00
2026-03-24 15:40:05 -04:00
2026-03-21 22:46:10 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2026-01-27 17:08:24 -05:00
2026-03-16 23:47:00 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-03-21 22:46:10 -04:00
2026-03-21 22:46:10 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2025-11-07 08:21:20 -05:00
2025-08-23 23:37:33 -04:00
2025-07-14 20:11:11 -04:00
2026-03-05 13:40:45 -05:00
2026-03-21 22:46:10 -04:00
2025-08-23 23:37:33 -04:00
2025-08-23 23:37:33 -04:00
2026-01-27 17:08:24 -05:00
2025-08-23 23:37:33 -04:00
2026-02-10 17:44:17 +07:00
2025-08-26 09:30:04 -04:00
2026-03-22 13:54:03 -04:00