awjuliani / web-rl-playgroundLinks
An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆76Updated 4 months ago
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below
Sorting:
- Fetch arxiv data to LLM-friendly text☆125Updated 7 months ago
- ☆77Updated last month
- ☆169Updated last year
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆298Updated 2 months ago
- A Deep Research agent from scratch☆211Updated 4 months ago
- ☆55Updated 10 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆205Updated 3 months ago
- Learning records for building a large language model from scratch☆58Updated 9 months ago
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆304Updated 2 months ago
- Curated resources for discovering, reading, and working with arXiv papers☆340Updated 4 months ago
- ☆77Updated 5 months ago
- A transformer-based multimodal model for music.☆29Updated last year
- An AI agent to control drones from your CLI☆134Updated 2 months ago
- Commit0: Library Generation from Scratch☆168Updated 5 months ago
- ☆41Updated last week
- Countdown Game Distill&RL☆47Updated last month
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 8 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆122Updated 2 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆83Updated 6 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated 2 months ago
- AlphaXIV open-source alternative: Chat with any arXiv paper.☆83Updated 4 months ago
- support BM25+vecetor☆29Updated 4 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆94Updated 5 months ago
- https://no-ocr.com/about☆164Updated 3 months ago
- ☆258Updated last month
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors (ACL Findings 2025)☆83Updated 4 months ago
- ☆49Updated 8 months ago
- An AI-powered interface for exploring and understanding arXiv research papers☆224Updated 2 weeks ago
- Open-source autonomous cleaning & housekeeping robot☆235Updated 2 months ago
- Challenges for general-purpose web-browsing AI agents☆65Updated 4 months ago