yixiaoer / mistral-jax
JAX implementation of the Mistral 7b v0.1 model
☆13Updated 11 months ago
Alternatives and similar repositories for mistral-jax:
Users that are interested in mistral-jax are comparing it to the libraries listed below
- flexible meta-learning in jax☆12Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- General Modules for JAX☆64Updated 2 weeks ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆18Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆20Updated 9 months ago
- PyTorch Package For Quasimetric Learning☆41Updated 4 months ago
- Minimal but scalable implementation of large language models in JAX☆34Updated 4 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆26Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 8 months ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- POPGym Library in JAX☆11Updated 11 months ago
- ☆28Updated 2 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆25Updated 9 months ago
- ☆20Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆48Updated last month
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 7 months ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆14Updated last year
- ☆47Updated 2 years ago
- Building blocks for productive research☆51Updated last month
- Learning Robust Dynamics Through Variational Sparse Gating☆21Updated 2 years ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- ☆74Updated 6 months ago
- ☆18Updated last month
- A collection of meta-learning algorithms in Jax☆22Updated 2 years ago
- ☆73Updated 4 months ago