An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆99Jun 4, 2025Updated 11 months ago
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- safety analysis for hard-to-specify failures☆30Apr 19, 2026Updated 2 weeks ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- Memory-Based Meta-Learning on Non-Stationary Distributions☆17Mar 14, 2024Updated 2 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 11 months ago
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 6 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆289Sep 25, 2025Updated 7 months ago
- Spitzers Architecture School Urban Lab for Unit 26. This repository explores designing and codifying urban systems from the bottom up in …☆14Mar 29, 2022Updated 4 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Defeating the Training-Inference Mismatch via FP16☆192Nov 14, 2025Updated 5 months ago
- Demonstration and tutorial notebooks for the Higra library☆13Sep 29, 2025Updated 7 months ago
- Roslyn-based static code analysis for pulumi programs written in C#☆12Jun 29, 2022Updated 3 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- A tool for an analysis of LLM generations.☆42Oct 13, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆316Jul 19, 2025Updated 9 months ago
- Simulator of a basic order book flow and order execution☆18Mar 22, 2023Updated 3 years ago
- HIVE: Evaluating the Human Interpretability of Visual Explanations (ECCV 2022)☆22Jan 19, 2023Updated 3 years ago
- Pytorch implementation of OCFGAN-GP (CVPR 2020, Oral).☆15Apr 3, 2020Updated 6 years ago
- ☆59Dec 12, 2025Updated 4 months ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated last year
- ☆17Aug 1, 2025Updated 9 months ago
- UNMAINTAINED: See celluloid/celluloid#779☆49Aug 21, 2018Updated 7 years ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆140Apr 30, 2026Updated last week
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- An encoder-decoder framework for learning from incomplete data☆46Jul 6, 2023Updated 2 years ago
- 0xFFFF 网站基础环境配置☆10May 14, 2022Updated 3 years ago
- New examples for EtherCard ENC28J60 library☆24Dec 9, 2011Updated 14 years ago
- OpenGL Paint for OS X☆26Aug 21, 2014Updated 11 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Apr 7, 2024Updated 2 years ago
- Web framework for GeoSolver☆14Feb 18, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- My solutions for Advanced Python Mastery (course by @dabeaz)☆11Jan 29, 2024Updated 2 years ago
- Unveiling the Economics of SQL Operations☆10Apr 21, 2024Updated 2 years ago
- [𝐍𝐚𝐭𝐮𝐫𝐞 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬] 🤖💡 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal C…☆26Apr 21, 2026Updated 2 weeks ago
- neon implementation of SegNet☆13Jan 3, 2023Updated 3 years ago
- Interactive visualizations of the geometric intuition behind diffusion models.☆1,109Apr 15, 2026Updated 3 weeks ago
- ☆11Mar 15, 2017Updated 9 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago