Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube
☆18Mar 12, 2019Updated 7 years ago
Alternatives and similar repositories for rubiks-cube-reinforcement-learning
Users that are interested in rubiks-cube-reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Aug 3, 2023Updated 2 years ago
- OpenAi gym environment for the Rubik's Cube (3x3x3).☆14Sep 1, 2022Updated 3 years ago
- simple grpo☆12May 28, 2025Updated 10 months ago
- Code repository dedicated to experimenting and research with tiny reasoning language model☆49Nov 24, 2025Updated 4 months ago
- Real Time STT model with GPU by Whisper and VAD(Voice Activity Detector) model☆15Jul 15, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [DICTA 2020] Detection and Tracking of Moving Objects using Recursive Cluster Change Detection☆24Nov 29, 2022Updated 3 years ago
- time-bomb.nvim is a minimal Neovim plugin for timers and Pomodoro cycles to boost developer focus. Features floating timers, 9 progress b…☆32Mar 12, 2026Updated 2 weeks ago
- a minimalistic todo app☆10May 10, 2023Updated 2 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- ☆13Mar 7, 2024Updated 2 years ago
- Command line tool for converting images to ASCII art☆20Oct 7, 2025Updated 5 months ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Just another static site generator -> あなたが恋しいです。☆11Dec 5, 2023Updated 2 years ago
- A very-minimal command-line parser☆20Jul 28, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Basic C++ Hidden Markov Model functionality implementation with state prediction estimation☆15Apr 29, 2013Updated 12 years ago
- Reinforcement learning in pure JAX.☆13Dec 24, 2025Updated 3 months ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated 11 months ago
- Argument execute is xargs alternative that supports arguments ordering☆16Oct 5, 2024Updated last year
- ☆12Mar 19, 2022Updated 4 years ago
- Produce intelligence by means of natural selection without objective/reward optimization☆15Sep 29, 2021Updated 4 years ago
- ☆12Feb 17, 2026Updated last month
- A concurrent pipeline example, written in Rust☆16Aug 7, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple Algorithms and Data structures implemented in Zig☆11May 15, 2022Updated 3 years ago
- A hello world app made in FastAPI and HTMX☆10Aug 26, 2023Updated 2 years ago
- Blog posts, mostly about Rust.☆13Jun 7, 2021Updated 4 years ago
- Sudoku game written in Zig, using SDL for graphics.☆12Updated this week
- Neovim Dotfiles☆12Sep 24, 2025Updated 6 months ago
- A Python library for management of configuration settings in TOML format across various applications.☆13Jul 14, 2024Updated last year
- ☆35May 16, 2025Updated 10 months ago
- Blazingly Fast Pseudo Random Number Generator written in Rust☆22Sep 21, 2024Updated last year
- An Awesome List of AdventOfCode Participants☆13Dec 14, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Convert between pulldown parser events for various markup formats☆24Dec 22, 2025Updated 3 months ago
- tmux cheatsheet in terminal friendly text format.☆10Jan 7, 2022Updated 4 years ago
- ☆16Jan 5, 2026Updated 2 months ago
- A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…☆14Mar 23, 2023Updated 3 years ago
- a simple text editor☆11Aug 24, 2025Updated 7 months ago
- Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X☆75Feb 11, 2026Updated last month
- Web app for creating animated illustrations with 3D JavaScript engine Zdog☆11Mar 17, 2022Updated 4 years ago