An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆98Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below
Sorting:
- ☆16Feb 22, 2025Updated last year
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆187Oct 31, 2025Updated 4 months ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated last year
- A tool for an analysis of LLM generations.☆42Oct 13, 2025Updated 4 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- Spitzers Architecture School Urban Lab for Unit 26. This repository explores designing and codifying urban systems from the bottom up in …☆14Mar 29, 2022Updated 3 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 9 months ago
- 两家交易所做比特币的高频对冲☆19Dec 5, 2019Updated 6 years ago
- Defeating the Training-Inference Mismatch via FP16☆183Nov 14, 2025Updated 3 months ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆27Apr 17, 2025Updated 10 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 4 months ago
- ☆24Apr 3, 2025Updated 11 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- ☆22Nov 8, 2021Updated 4 years ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆42Jul 21, 2025Updated 7 months ago
- An algorithmic trading robot written in Python.☆28Jun 4, 2017Updated 8 years ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆284Sep 25, 2025Updated 5 months ago
- ☆29Oct 24, 2025Updated 4 months ago
- ☆45Jun 10, 2025Updated 8 months ago
- ☆35May 16, 2025Updated 9 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- ☆88Updated this week
- This is for the capstone project "Optimal Execution of a VWAP order".☆39Nov 21, 2019Updated 6 years ago
- Code for "Exploring Dynamic Selection of Branch Expansion Orders for Code Generation" (ACL 2021)☆31Apr 11, 2022Updated 3 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆133Feb 21, 2026Updated 2 weeks ago
- This repository contains source code and a high-quality test dataset for "Automated Commit Message Generation with Large Language Models.…☆10Nov 6, 2025Updated 4 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- CrewAI-Agentic-Jira: Enhance your Jira workflows with intelligent agent-driven automation. Powered by the CrewAI framework, this project …☆22Feb 3, 2025Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆152Sep 19, 2025Updated 5 months ago
- Code implementation for CoTexT: Multi-task Learning with Code-Text Transformer☆36Sep 14, 2021Updated 4 years ago
- ☆11May 18, 2023Updated 2 years ago
- 《算法竞赛入门经典》第二版(第2版)-例题习题解答☆10May 10, 2021Updated 4 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆30Updated this week
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 8 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago