yixiaoer / mistral-jaxLinks
JAX implementation of the Mistral 7b v0.1 model
☆13Updated last year
Alternatives and similar repositories for mistral-jax
Users that are interested in mistral-jax are comparing it to the libraries listed below
Sorting:
- flexible meta-learning in jax☆14Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Minimal but scalable implementation of large language models in JAX☆35Updated 7 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- General Modules for JAX☆65Updated 2 months ago
- ☆31Updated 7 months ago
- Building blocks for productive research☆59Updated 4 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆20Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆28Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆29Updated last year
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆28Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆30Updated last year
- ☆22Updated 2 months ago
- PyTorch Package For Quasimetric Learning☆42Updated 7 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆14Updated 2 years ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆11Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- ☆36Updated 2 years ago
- ☆54Updated 7 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 7 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- Offline RL experiments☆15Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆35Updated 11 months ago
- Scaling scaling laws with board games.☆49Updated last year