[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
☆68Jan 7, 2026Updated 2 months ago
Alternatives and similar repositories for Synapse
Users that are interested in Synapse are comparing it to the libraries listed below
Sorting:
- ☆59Jan 9, 2024Updated 2 years ago
- ☆35Jun 20, 2024Updated last year
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Jul 16, 2024Updated last year
- ☆20Apr 24, 2024Updated last year
- A codebase for "Language Models can Solve Computer Tasks"☆240May 1, 2024Updated last year
- ☆12Oct 5, 2020Updated 5 years ago
- An LSTM model implemented by PyTorch to perform sentiment classification on the Stanford Sentiment Treebank (SST-5) dataset.☆11Sep 13, 2022Updated 3 years ago
- ☆35Jan 12, 2026Updated last month
- VisualWebArena is a benchmark for multimodal agents.☆440Nov 9, 2024Updated last year
- ☆12Aug 8, 2024Updated last year
- This Python package implements algorithms for multiviews (multimodals) learning☆14Sep 26, 2024Updated last year
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆15May 27, 2023Updated 2 years ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆60Jul 11, 2025Updated 7 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆136Updated this week
- The model, data and code for the visual GUI Agent SeeClick☆469Jul 13, 2025Updated 7 months ago
- TensorFlow implementation of A Fast and Accurate Dependency Parser using Neural Networks☆17Jan 21, 2020Updated 6 years ago
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- Source code of the paper "Synchronous Double-channel Recurrent Network for Aspect-Opinion Pair Extraction, ACL 2020."☆12Aug 10, 2020Updated 5 years ago
- GPT-4V in Wonderland: LMMs as Smartphone Agents☆135Jul 17, 2024Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 10 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Mar 11, 2024Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained langua…☆17Dec 6, 2024Updated last year
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆44Nov 19, 2025Updated 3 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆229Jun 16, 2025Updated 8 months ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆16Dec 22, 2024Updated last year
- ☆19May 19, 2024Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆20Jan 11, 2026Updated last month
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆952Nov 5, 2025Updated 4 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆256Apr 24, 2025Updated 10 months ago
- Repo for ICML'23 paper SurCo Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems☆19Jul 11, 2023Updated 2 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22May 28, 2021Updated 4 years ago
- Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)☆26Dec 2, 2023Updated 2 years ago
- Building a comprehensive and handy list of papers for GUI agents☆642Oct 27, 2025Updated 4 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,353Nov 26, 2025Updated 3 months ago
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆300Jul 18, 2025Updated 7 months ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆234Feb 23, 2026Updated last week
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆15Nov 5, 2022Updated 3 years ago