TheDuckAI / DuckTrack
Multimodal computer agent data collection program
☆119Updated last year
Alternatives and similar repositories for DuckTrack:
Users that are interested in DuckTrack are comparing it to the libraries listed below
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆140Updated this week
- ☆36Updated last year
- ☆51Updated 6 months ago
- ☆81Updated last year
- Public Inflection Benchmarks☆69Updated 11 months ago
- ☆38Updated 6 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆125Updated 2 months ago
- Multimodal language model benchmark, featuring challenging examples☆158Updated last month
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆46Updated 2 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆160Updated this week
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆167Updated last month
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆47Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆81Updated 3 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- LILO: Library Induction with Language Observations☆83Updated 5 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆137Updated last week
- A lightweight script for processing HTML page to markdown format with support for code blocks☆78Updated 10 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆98Updated 5 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆96Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆74Updated last year
- ☆120Updated 8 months ago
- An AI agent for interacting with a computer using the graphical user interface☆74Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆37Updated this week
- ☆142Updated 2 months ago
- ☆121Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆192Updated last week