mlfoundations / GelatoLinks
π¨ Gelato β From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents
β37Updated last month
Alternatives and similar repositories for Gelato
Users that are interested in Gelato are comparing it to the libraries listed below
Sorting:
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agentsβ37Updated 4 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engineβ25Updated last year
- The official repo for βUnleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problemβ [EMNLP25]β34Updated 5 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!β53Updated 10 months ago
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β49Updated 2 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"β102Updated 5 months ago
- β17Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hoursβ65Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understandingβ53Updated last year
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"β97Updated 3 months ago
- The Library for LLM-based multi-agent applicationsβ102Updated 6 months ago
- β18Updated 9 months ago
- The theory of mind module for the SWE agentβ73Updated 3 weeks ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?β23Updated last year
- β87Updated last year
- Challenges for general-purpose web-browsing AI agentsβ67Updated 8 months ago
- Lightweight OpenHands CLI in a binary executableβ100Updated this week
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!β137Updated 4 months ago
- β57Updated last year
- LIMI: Less is More for Agencyβ160Updated 3 months ago
- β39Updated last year
- β67Updated 10 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web paβ¦β11Updated last year
- [NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applicationsβ143Updated 7 months ago
- β88Updated 8 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3β10Updated 4 months ago
- entropix style sampling + GUIβ27Updated last year
- C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Taskingβ37Updated 7 months ago
- Get aid from local LLMs right in your PowerShellβ15Updated 9 months ago
- β19Updated 11 months ago