agential-ai / agential
šš§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
ā51Updated last month
Alternatives and similar repositories for agential:
Users that are interested in agential are comparing it to the libraries listed below
- ā50Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentā55Updated 6 months ago
- ā48Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.ā75Updated 5 months ago
- ā70Updated last month
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)ā35Updated 2 months ago
- ā119Updated 5 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.ā32Updated last year
- ā57Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)ā72Updated last week
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsā24Updated last year
- Small, simple agent task environments for training and evaluationā18Updated 4 months ago
- ā60Updated last month
- Learning to route instances for Human vs AI Feedbackā20Updated last month
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environmentsā47Updated 2 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"ā52Updated 3 months ago
- ā27Updated 2 weeks ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetā14Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lā¦ā43Updated last year
- A set of utilities for running few-shot prompting experiments on large-language modelsā118Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".ā66Updated 8 months ago
- ā41Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"ā54Updated last year
- ā20Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)ā76Updated 4 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agentsā23Updated 2 years ago
- Evaluating LLMs with CommonGen-Liteā89Updated 11 months ago