OSU-NLP-Group / SeeActChromeExtensionLinks
☆18Updated 10 months ago
Alternatives and similar repositories for SeeActChromeExtension
Users that are interested in SeeActChromeExtension are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- ☆62Updated 5 months ago
- ☆67Updated 7 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆66Updated 11 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆41Updated 11 months ago
- ☆32Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated 11 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated 11 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆132Updated last month
- ☆35Updated 6 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 4 months ago
- ☆41Updated last year
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated last year
- ☆86Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 11 months ago
- Run SWE-bench evaluations remotely☆44Updated 3 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- ☆88Updated 3 weeks ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆34Updated last month
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆21Updated last year
- CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.☆64Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 7 months ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated 11 months ago
- The Library for LLM-based multi-agent applications☆91Updated 4 months ago
- Interaction-first method for generating demonstrations for web-agents on any website☆51Updated 7 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆60Updated 6 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago