An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆98Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 9 months ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- Memory-Based Meta-Learning on Non-Stationary Distributions☆17Mar 14, 2024Updated 2 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆16Feb 22, 2025Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 5 months ago
- ☆47Jun 10, 2025Updated 9 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 5 months ago
- A simple sample that shows what you need to package an F# app as a flatpak☆10Jul 5, 2023Updated 2 years ago
- Statistical analysis methods for comparing prompt and model performance in LLM evaluations.☆84Mar 22, 2026Updated last week
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- ☆12Feb 4, 2024Updated 2 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"☆19Jul 18, 2024Updated last year
- ☆12Jul 17, 2023Updated 2 years ago
- ☆13Mar 10, 2023Updated 3 years ago
- Welcome to my Transformers tutorial series! In this series, I'll be diving into the powerful Transformer architecture and its implementat…☆10May 3, 2023Updated 2 years ago
- ☆15May 4, 2024Updated last year
- Bicameral-GPT is an experimental, personalized generative agent trained on journal entries.☆20Jul 25, 2023Updated 2 years ago
- A POC of the Blazor SSR capabilities in .NET 8☆14Apr 5, 2024Updated last year
- The Gödelian Toolkit: Systematically Testing Simple Languages☆16Aug 6, 2025Updated 7 months ago
- Chrome extension to generate tests for solidjs.☆18Oct 12, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Want to use NextJS with TailwindCSS without doing the setup manually? This is for you!☆12Aug 8, 2022Updated 3 years ago
- Small library for creating diffs and applying them☆21May 14, 2020Updated 5 years ago
- Dump complex C declarations visually.☆65Dec 27, 2025Updated 3 months ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated 11 months ago
- A repo of a modified version of Diffusion Transformer☆49Sep 14, 2025Updated 6 months ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆136Feb 21, 2026Updated last month
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Data and Code for COLM 2025 Paper "MSRS: Evaluating Multi-Source Retrieval-Augmented Generation"☆33Aug 29, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- High-performance LLM operator library built on TileLang.☆96Updated this week
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- This playground is a collection of notebooks that demonstrate how to use Elasticsearch.NET and NEST clients.☆16Nov 2, 2024Updated last year
- This repo has been migrated to https://code.larus.se/lmas/Damerau-Levenshtein☆11Jul 21, 2023Updated 2 years ago
- Web framework for GeoSolver☆14Feb 18, 2017Updated 9 years ago
- Fmodel demo - Functional and Algebraic Domain modeling - Ktor☆20Mar 19, 2026Updated last week
- Prompt-based software development☆23Aug 25, 2024Updated last year