thunder.distributed¶
|
Thunder's Distributed Data Parallel. |
|
Convert |
|
Specifies the sharding strategy to be used for FSDP in Thunder. |
|
Specify how we group parameters into a bucket for collective communication in fsdp. |
Set whether to skip data parallel grad sync. |
|
Reset whether to skip data parallel grad sync. |
|
Get whether to skip data parallel grad sync. |
|
A context manager to skip data parallel grad sync. |
|
|
Convert specified modules into column-wise parallel ones. |
|
Convert specified modules into row-wise parallel ones. |