awjuliani / web-rl-playgroundView external linksLinks
An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆96Jun 4, 2025Updated 8 months ago
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below
Sorting:
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 8 months ago
- ☆16Feb 22, 2025Updated 11 months ago
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆182Oct 31, 2025Updated 3 months ago
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated last year
- A tool for an analysis of LLM generations.☆42Oct 13, 2025Updated 4 months ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- Simulator of a basic order book flow and order execution☆18Mar 22, 2023Updated 2 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 8 months ago
- ☆25Dec 13, 2024Updated last year
- 两家交易所做比特币的高频对冲☆19Dec 5, 2019Updated 6 years ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆40Jul 21, 2025Updated 6 months ago
- ☆24Apr 3, 2025Updated 10 months ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆27Apr 17, 2025Updated 9 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- ☆22Nov 8, 2021Updated 4 years ago
- Shopify Backend Developer Intern Challenge - Summer 2022☆11Jan 15, 2022Updated 4 years ago
- A GTK graphical interface for chatting with large language models (LLMs)☆83Dec 15, 2025Updated 2 months ago
- ☆45Jun 10, 2025Updated 8 months ago
- Unveiling the Economics of SQL Operations☆10Apr 21, 2024Updated last year
- Search and analyze your text data☆10Nov 19, 2024Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- ☆35May 16, 2025Updated 8 months ago
- ☆86Updated this week
- Code for "Exploring Dynamic Selection of Branch Expansion Orders for Code Generation" (ACL 2021)☆31Apr 11, 2022Updated 3 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Oct 9, 2025Updated 4 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆151Sep 19, 2025Updated 4 months ago
- This repository contains source code and a high-quality test dataset for "Automated Commit Message Generation with Large Language Models.…☆10Nov 6, 2025Updated 3 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆22Nov 13, 2025Updated 3 months ago
- CrewAI-Agentic-Jira: Enhance your Jira workflows with intelligent agent-driven automation. Powered by the CrewAI framework, this project …☆21Feb 3, 2025Updated last year
- ☆37Mar 16, 2022Updated 3 years ago
- Code implementation for CoTexT: Multi-task Learning with Code-Text Transformer☆36Sep 14, 2021Updated 4 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- AWS virtual infrastructure simulator for training reinforcement learning based cloud capacity management systems☆11Sep 23, 2020Updated 5 years ago
- code for polite☆11Feb 28, 2024Updated last year
- A Simple Web Crawler from Scratch.☆11Dec 2, 2017Updated 8 years ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Oct 8, 2024Updated last year
- This project aims to convert the content of GitHub repositories into a structured, machine-readable format, enabling AI models like ChatG…☆12May 13, 2024Updated last year