yandex-research / swarm

Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"
128Updated 11 months ago

Related projects

Alternatives and complementary repositories for swarm