theOGognf / rl8
A high throughput, end-to-end RL library for infinite horizon tasks.
☆16Updated 3 months ago
Related projects: ⓘ
- Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments☆13Updated 2 months ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆25Updated 8 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 7 months ago
- A fully configurable Gymnasium compatible Tetris environment☆17Updated last week
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated 3 months ago
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last week
- LLM Optimize is a proof-of-concept library for doing LLM (large language model) guided blackbox optimization.☆48Updated last year
- A visual tool to interpret and understand PyTorch machine learning models☆14Updated 7 months ago
- A PyTorch implementation of constrained optimization and modeling techniques☆26Updated 4 months ago
- Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch☆53Updated 2 weeks ago
- A simple library for working with Hugging Face models.☆15Updated last week
- Nomadic is an enterprise-grade toolkit for teams to continuously optimize compound AI systems☆58Updated this week
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆20Updated this week
- Efficient baselines for autocurricula in JAX.☆165Updated 3 weeks ago
- ☆27Updated 2 months ago
- An extensible framework for creating RL environments based on Minetest and Gymnasium☆26Updated this week
- Interpreting how transformers simulate agents performing RL tasks☆62Updated 10 months ago
- Produce intelligence by means of natural selection without objective/reward optimization☆13Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆36Updated last year
- LLama implementations benchmarking framework☆10Updated 10 months ago
- Schedule free optimiser implemented in JAX using Optimistix☆14Updated 3 months ago
- fast + parallel AlphaZero in JAX☆80Updated 5 months ago
- Mini RL Lab☆14Updated 3 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 3 months ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆46Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated 4 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆10Updated 2 months ago
- Standard interface for entity based reinforcement learning environments.☆35Updated 6 months ago
- Example Agents for DIAMBRA Arena Environments☆13Updated 2 weeks ago
- NLP with Rust for Python 🦀🐍☆57Updated 3 months ago