fix flaky tests; should be using train loss from final step rather than final avg train loss
This commit is contained in:
@@ -94,7 +94,7 @@ class TestSequenceParallelism:
|
||||
|
||||
check_tensorboard(
|
||||
temp_dir + "/runs",
|
||||
"train/train_loss",
|
||||
"train/loss",
|
||||
threshold,
|
||||
"Train Loss (%s) is too high",
|
||||
)
|
||||
|
||||
Reference in New Issue
Block a user