AashrayAnand / rubiks-cube-reinforcement-learningLinks
Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube
☆18Updated 6 years ago
Alternatives and similar repositories for rubiks-cube-reinforcement-learning
Users that are interested in rubiks-cube-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- A RL benchmark framework based on real world problem☆12Updated 2 years ago
- ☆45Updated 5 months ago
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆100Updated last week
- A flexible and extensible reinforcement learning library for Python, designed for both beginners and researchers.☆18Updated 9 months ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆35Updated 11 months ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆16Updated 11 months ago
- Notes on quantization in neural networks☆104Updated last year
- ☆38Updated 9 months ago
- Exploration into the Firefly algorithm in Pytorch☆41Updated 7 months ago
- ☆30Updated last year
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆211Updated last year
- Mini RL Lab☆17Updated last year
- Gradient Boosting Reinforcement Learning (GBRL)☆120Updated 2 months ago
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆33Updated last month
- From scratch implementation of a vision language model in pure PyTorch☆243Updated last year
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆177Updated 2 months ago
- Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew …☆35Updated last month
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆96Updated 9 months ago
- ☆78Updated 3 weeks ago
- ☆199Updated 10 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆121Updated 5 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆136Updated 5 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆116Updated 2 years ago
- ☆68Updated 5 months ago
- Documentation, notes, links, etc for streams.☆82Updated last year
- A simple web demo with minimal framework using PyTorch and Streamlit to showcase an image classification model.☆12Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆84Updated last year
- ☆206Updated 9 months ago
- Contrastive Reinforcement Learning☆45Updated last month
- VIT inference in triton because, why not?☆31Updated last year