AakashKumarNain / mistral_jax
This is a port of Mistral-7B model in JAX
☆30Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for mistral_jax
- Experiment of using Tangent to autodiff triton☆72Updated 9 months ago
- Lightning-like training API for JAX with Flax☆34Updated 6 months ago
- JAX implementation of the Mistral 7b v0.2 model☆33Updated 4 months ago
- LoRA for arbitrary JAX models and functions☆132Updated 8 months ago
- A functional training loops library for JAX☆85Updated 9 months ago
- Neural Networks for JAX☆83Updated last month
- ☆40Updated 4 months ago
- Pytorch-like dataloaders in JAX.☆59Updated last month
- Multidimensional indexing for tensors☆113Updated last year
- Implementation of Flash Attention in Jax☆196Updated 8 months ago
- Einsum-like high-level array sharding API for JAX☆32Updated 4 months ago
- Named Tensors for Legible Deep Learning in JAX☆153Updated this week
- ☆57Updated 2 years ago
- Scalable neural net training via automatic normalization in the modular norm.☆121Updated 3 months ago
- JMP is a Mixed Precision library for JAX.☆187Updated 6 months ago
- Graph neural networks in JAX.☆67Updated 5 months ago
- A simple library for scaling up JAX programs☆127Updated 2 weeks ago
- Flow-matching algorithms in JAX☆77Updated 3 months ago
- If it quacks like a tensor...☆52Updated last week
- Run PyTorch in JAX. 🤝☆200Updated last year
- Tensor Parallelism with JAX + Shard Map☆11Updated last year
- ☆71Updated this week
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 months ago
- ☆128Updated this week
- ☆105Updated 2 weeks ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆79Updated 9 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- Multiple dispatch over abstract array types in JAX.☆105Updated last week
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 10 months ago