Sequence Parallelism
Configuration
To enable sequence parallelism, add the following to your configuration file:
# Set to a divisor (> 1) of the number of GPUs available
-sequence_parallel_degree: 4 # Split sequences across 4 GPUsThe sequence_parallel_degree should be a divisor of the total number of GPUs. For example:
- With 8 GPUs, valid values would be 2, 4, or 8 @@ -531,12 +539,17 @@ through a ring communication pattern.