General multi-task deep RL Agent
☆186Jun 6, 2024Updated last year
Alternatives and similar repositories for jat
Users that are interested in jat are comparing it to the libraries listed below
Sorting:
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆66Feb 25, 2025Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Oct 6, 2024Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆20Jun 16, 2024Updated last year
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆33Dec 1, 2024Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Feb 20, 2026Updated last week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆133Dec 3, 2024Updated last year
- ☆13Oct 5, 2025Updated 4 months ago
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- A web based platform for collecting human actions in reinforcement learning environments☆31Sep 10, 2025Updated 5 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23May 6, 2025Updated 9 months ago
- ☆56Nov 6, 2024Updated last year
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Aug 22, 2024Updated last year
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆261Oct 31, 2025Updated 4 months ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- ☆27Jan 22, 2025Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Jul 5, 2023Updated 2 years ago
- ☆12Jul 8, 2024Updated last year
- RL Environments in JAX 🌍☆864May 30, 2025Updated 9 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆76Aug 2, 2023Updated 2 years ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,399Nov 29, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆282Jul 11, 2024Updated last year
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆675Aug 22, 2025Updated 6 months ago
- ☆13Apr 7, 2024Updated last year
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- ☆15Aug 19, 2025Updated 6 months ago
- ☆12Nov 5, 2024Updated last year
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year