AashrayAnand / rubiks-cube-reinforcement-learningLinks
Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube
☆18Updated 6 years ago
Alternatives and similar repositories for rubiks-cube-reinforcement-learning
Users that are interested in rubiks-cube-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- ☆36Updated 8 months ago
- A RL benchmark framework based on real world problem☆11Updated 2 years ago
- Exploration into the Firefly algorithm in Pytorch☆40Updated 6 months ago
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆97Updated this week
- Notes on quantization in neural networks☆97Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆115Updated 2 years ago
- ☆44Updated 4 months ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆16Updated 10 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆178Updated last month
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆32Updated 3 weeks ago
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆63Updated this week
- Reading list for adversarial perspective and robustness in deep reinforcement learning.☆120Updated last month
- Gradient Boosting Reinforcement Learning (GBRL)☆118Updated 3 weeks ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆131Updated 4 months ago
- A flexible and extensible reinforcement learning library for Python, designed for both beginners and researchers.☆18Updated 8 months ago
- Explorations into improving ViTArc with Slot Attention☆42Updated 10 months ago
- making the official triton tutorials actually comprehensible☆53Updated last week
- Fast reinforcement learning 💨☆26Updated last month
- LLaMA 2 implemented from scratch in PyTorch☆347Updated last year
- VIT inference in triton because, why not?☆31Updated last year
- ☆77Updated 3 weeks ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆156Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆201Updated last year
- Various reinforcement learning algorithms written in Jax + Flax☆26Updated 2 years ago
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆40Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 11 months ago
- ☆30Updated last year
- Pytorch (Lightning) implementation of the Mamba model☆29Updated 4 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆117Updated 3 months ago
- A pure and fast NumPy implementation of Mamba with cache support.☆17Updated last year