yixiaoer / mistral-jax
JAX implementation of the Mistral 7b v0.1 model
☆13Updated last year
Alternatives and similar repositories for mistral-jax:
Users that are interested in mistral-jax are comparing it to the libraries listed below
- flexible meta-learning in jax☆13Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- High quality implementations of imitation and inverse reinforcement learning algorithms☆15Updated last month
- ☆18Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆15Updated 3 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆30Updated 5 months ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆14Updated last year
- Minimal but scalable implementation of large language models in JAX☆34Updated 6 months ago
- General Modules for JAX☆64Updated last month
- Reinforcement Learning inside a 3D soccer simulation☆26Updated 7 months ago
- ☆13Updated 9 months ago
- POPGym Library in JAX☆11Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆27Updated 6 months ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Einsum-like high-level array sharding API for JAX☆34Updated 9 months ago
- ☆77Updated last month
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆26Updated last year
- ☆20Updated 2 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 2 weeks ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- An Open-Ended Agentic Simulator☆48Updated 9 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆50Updated last week
- ☆22Updated last month
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆16Updated 5 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- ☆78Updated 6 months ago