yixiaoer / mistral-jaxLinks
JAX implementation of the Mistral 7b v0.1 model
☆13Updated last year
Alternatives and similar repositories for mistral-jax
Users that are interested in mistral-jax are comparing it to the libraries listed below
Sorting:
- flexible meta-learning in jax☆14Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆34Updated 7 months ago
- GPT implementation in Flax☆18Updated 3 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Einsum-like high-level array sharding API for JAX☆34Updated 10 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 11 months ago
- General Modules for JAX☆66Updated last month
- ☆18Updated 2 years ago
- Building blocks for productive research☆55Updated 4 months ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆11Updated last week
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆28Updated 2 years ago
- Flax Implementation of DreamerV3 on Crafter☆14Updated 3 months ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆15Updated last year
- ☆53Updated 7 months ago
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆20Updated 2 years ago
- POPGym Library in JAX☆11Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆16Updated 2 years ago
- ☆20Updated 2 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated 11 months ago
- Various reinforcement learning algorithms written in Jax + Flax☆24Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 9 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆67Updated last year