OSU-NLP-Group / SeeActChromeExtensionLinks
☆16Updated 5 months ago
Alternatives and similar repositories for SeeActChromeExtension
Users that are interested in SeeActChromeExtension are comparing it to the libraries listed below
Sorting:
- ☆13Updated 5 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆56Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆17Updated 6 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆21Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 8 months ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆15Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 4 months ago
- ☆50Updated last week
- ☆38Updated 5 months ago
- CodeNav is an LLM agent that navigates and leverages previously unseen code repositories to solve user queries.☆48Updated 9 months ago
- ☆20Updated last month
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆38Updated last month
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 7 months ago
- ☆9Updated last month
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆52Updated 2 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆61Updated last week
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- ☆65Updated 2 months ago
- ☆26Updated 10 months ago
- ☆40Updated 10 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 4 months ago
- ☆15Updated 2 months ago