OSU-NLP-Group / SeeActChromeExtensionLinks
☆16Updated 7 months ago
Alternatives and similar repositories for SeeActChromeExtension
Users that are interested in SeeActChromeExtension are comparing it to the libraries listed below
Sorting:
- ☆11Updated 9 months ago
- ☆67Updated 5 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆61Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated 8 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆30Updated 2 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆38Updated 8 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 8 months ago
- ☆84Updated last year
- Run SWE-bench evaluations remotely☆40Updated 2 weeks ago
- ☆56Updated 2 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆107Updated last month
- ☆23Updated 11 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆15Updated 4 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated last month
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated 9 months ago
- ☆28Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆46Updated 6 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Updated 10 months ago
- ☆40Updated 8 months ago
- ☆41Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆56Updated 6 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆71Updated 4 months ago
- ☆23Updated 3 weeks ago
- XmodelLM☆39Updated 9 months ago
- ☆30Updated last year
- ☆13Updated 8 months ago