AashrayAnand / rubiks-cube-reinforcement-learningLinks
Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube
☆18Updated 6 years ago
Alternatives and similar repositories for rubiks-cube-reinforcement-learning
Users that are interested in rubiks-cube-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆102Updated 2 months ago
- ☆39Updated 11 months ago
- A RL benchmark framework based on real world problem☆13Updated 2 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆26Updated 2 years ago
- Gym env for Slay the Spire☆14Updated 11 months ago
- A high throughput, end-to-end RL library for infinite-horizon tasks.☆21Updated last month
- From scratch implementation of a vision language model in pure PyTorch☆252Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆224Updated last year
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆206Updated 4 months ago
- Mini RL Lab☆17Updated last year
- Notes on quantization in neural networks☆113Updated 2 years ago
- Reading list for adversarial perspective and robustness in deep reinforcement learning.☆126Updated 4 months ago
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆71Updated last week
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆122Updated last year
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆16Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Updated 10 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆147Updated 7 months ago
- Reproduction of DeepSeek-R1☆244Updated 8 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆130Updated last month
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated last year
- Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew …☆35Updated 3 months ago
- making the official triton tutorials actually comprehensible☆82Updated 3 months ago
- Titans - Learning to Memorize at Test Time☆45Updated 11 months ago
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Updated 4 months ago
- Contrastive Reinforcement Learning☆51Updated 2 weeks ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆161Updated 3 weeks ago
- Distributed training (multi-node) of a Transformer model☆89Updated last year
- ☆45Updated 7 months ago
- ☆11Updated 5 years ago
- General multi-task deep RL Agent☆185Updated last year