crowsonkb / torch-dist-utils
Utilities for PyTorch distributed
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for torch-dist-utils
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 2 weeks ago
- ☆31Updated 2 months ago
- ☆18Updated last month
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- A JAX implementation of the continuous time formulation of Consistency Models☆83Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆30Updated last year
- Latent Diffusion Language Models☆67Updated last year
- Describe the format of image/text datasets☆11Updated 2 years ago
- FID computation in Jax/Flax.☆24Updated 4 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆16Updated this week
- ☆19Updated last week
- PyTorch interface for TrueGrad Optimizers☆39Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆86Updated 5 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Automatically take good care of your preemptible TPUs☆32Updated last year
- A JAX nn library☆21Updated 8 months ago
- ☆27Updated 2 weeks ago
- ☆73Updated 4 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 10 months ago
- ☆21Updated 5 months ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆43Updated last month
- Train vision models using JAX and 🤗 transformers☆95Updated 3 weeks ago
- ☆33Updated 6 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 5 months ago