OSU-NLP-Group / SeeActChromeExtensionLinks
☆18Updated 11 months ago
Alternatives and similar repositories for SeeActChromeExtension
Users that are interested in SeeActChromeExtension are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆66Updated last year
- Run SWE-bench evaluations remotely☆46Updated 4 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆42Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- ☆32Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated last year
- ☆67Updated 8 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆18Updated last year
- ☆63Updated 5 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 8 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Updated 5 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated last year
- ☆19Updated 2 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆58Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- ☆24Updated last year
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆35Updated 2 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 11 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆33Updated 3 months ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- ☆84Updated last year
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆133Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- ☆86Updated 2 years ago
- ☆40Updated last year
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆20Updated this week