fattorib / transformer_shmap

Tensor Parallelism with JAX + Shard Map
11Updated last year

Related projects

Alternatives and complementary repositories for transformer_shmap