awjuliani / web-rl-playgroundLinks
An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.
☆60Updated 2 weeks ago
Alternatives and similar repositories for web-rl-playground
Users that are interested in web-rl-playground are comparing it to the libraries listed below
Sorting:
- Fetch arxiv data to LLM-friendly text☆117Updated 3 months ago
- Inference Llama 2 in C++☆43Updated last year
- Commit0: Library Generation from Scratch☆149Updated 3 weeks ago
- Countdown Game Distill&RL☆47Updated last month
- ☆53Updated 6 months ago
- A Deep Research agent from scratch☆178Updated 2 weeks ago
- LLM-as-SERP☆66Updated 3 months ago
- ☆68Updated last month
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆81Updated last month
- ☆48Updated 4 months ago
- Learning records for building a large language model from scratch☆55Updated 5 months ago
- ☆57Updated 3 months ago
- LLM reads a paper and produce a working prototype☆57Updated last month
- ☆77Updated last month
- ☆57Updated last week
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆78Updated 2 months ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 4 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆45Updated last month
- Challenges for general-purpose web-browsing AI agents☆58Updated last week
- ☆165Updated last year
- CursorCore: Assist Programming through Aligning Anything☆123Updated 3 months ago
- ☆41Updated 5 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆38Updated last month
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 8 months ago
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆40Updated this week
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆173Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- 📰 Building News Agents to Summarize News with MCP, Q, and tmux☆254Updated 3 weeks ago