An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆103Jun 4, 2025Updated last year
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆245Oct 31, 2025Updated 7 months ago
- A GTK graphical interface for chatting with large language models (LLMs)☆84Dec 15, 2025Updated 6 months ago
- Official Repository for Task-Circuit Quantization☆27Jun 1, 2025Updated last year
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …☆12Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 7 years ago
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- ☆17Feb 22, 2025Updated last year
- Zero Academic Homepage is a clean, modern and responsive theme for academic personal websites.☆41Jun 6, 2025Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆28Oct 14, 2025Updated 8 months ago
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆452Dec 28, 2025Updated 5 months ago
- ☆45Jun 10, 2025Updated last year
- LEGO-SLAM: Language-Embedded Gaussian Optimization SLAM☆75May 12, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the "Hiding Data In Sound" video☆12Jan 19, 2023Updated 3 years ago
- Get beautiful, world-class documentation for any repo☆461Apr 3, 2025Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- Defeating the Training-Inference Mismatch via FP16☆196Nov 14, 2025Updated 7 months ago
- A simple sample that shows what you need to package an F# app as a flatpak☆10Jul 5, 2023Updated 2 years ago
- A graph based approach to type inference written in F#☆22Apr 22, 2026Updated last month
- ☆12Feb 4, 2024Updated 2 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🐛 Learn to make Centipede in Unity.☆11Jul 14, 2024Updated last year
- Demonstration and tutorial notebooks for the Higra library☆13Sep 29, 2025Updated 8 months ago
- ☆13Mar 10, 2023Updated 3 years ago
- Roslyn-based static code analysis for pulumi programs written in C#☆12Jun 29, 2022Updated 3 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- ☆25Dec 13, 2024Updated last year
- ☆16Nov 16, 2024Updated last year
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆647Apr 8, 2025Updated last year
- Official implementation for "CONVIQT: Contrastive Video Quality Estimator"☆25Jun 14, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Welcome to my Transformers tutorial series! In this series, I'll be diving into the powerful Transformer architecture and its implementat…☆10May 3, 2023Updated 3 years ago
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆319Jul 19, 2025Updated 11 months ago
- https://no-ocr.com/about☆185Jun 30, 2025Updated 11 months ago
- ComfyUI for Audio☆42Sep 21, 2025Updated 8 months ago
- Commodore C16 and Plus/4 for MiSTer☆15Jun 4, 2026Updated 2 weeks ago
- ☆24Jun 30, 2025Updated 11 months ago
- A unified multimodal model toolkit☆128May 18, 2026Updated last month