theOGognf / rl8
A high throughput, end-to-end RL library for infinite horizon tasks.
☆18Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for rl8
- Alpha-Zero Connect Four NN trained via self play☆13Updated last month
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆29Updated 3 weeks ago
- Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments☆15Updated 4 months ago
- Zephyr is a declarative neural network library on top of JAX allowing for easy and fast neural network designing, creation, and manipulat…☆25Updated this week
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 9 months ago
- Code repository for Liquid Time-stochasticity networks (LTSs)☆20Updated last year
- ☆15Updated this week
- RAG Agent for the ARC AGI Challenge☆21Updated 4 months ago
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last month
- Schedule free optimiser implemented in JAX using Optimistix☆14Updated 5 months ago
- ☆53Updated last week
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆25Updated this week
- Reinforcement Learning for Stock Market Prediction☆60Updated 6 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated 5 months ago
- Portfolio Management for Everyone☆12Updated 10 months ago
- ☆15Updated this week
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆58Updated last year
- A simple library for working with Hugging Face models.☆15Updated 2 months ago
- A framework for creating rich, 3D, Minecraft-like environments for AI research based on Minetest☆34Updated this week
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Updated last year
- LLama implementations benchmarking framework☆12Updated last year
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆10Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆36Updated 2 weeks ago
- A fully configurable Gymnasium compatible Tetris environment☆20Updated last week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- Genetics for Language Models☆12Updated 4 months ago
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- Automatic Differentiation for Gradient Boosted Decision Trees.☆11Updated 2 years ago
- Repository for portfolio management using Pytorch, SQLAlchemy and XArray. The management is done using the reinforcement learning algorit…☆26Updated 3 years ago