agential-ai / agential
ππ§ Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
β44Updated this week
Related projects β
Alternatives and complementary repositories for agential
- β28Updated 2 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"β75Updated 3 weeks ago
- β40Updated last month
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β64Updated 4 months ago
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environmentsβ32Updated last month
- β38Updated this week
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Aβ¦β39Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.β71Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 3 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"β47Updated 5 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ61Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ46Updated 2 months ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ41Updated 8 months ago
- β111Updated last month
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β68Updated 3 weeks ago
- β44Updated last month
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β28Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Modelsβ¦β30Updated 9 months ago
- β14Updated this week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examplesβ38Updated last month
- β41Updated last month
- The Library for LLM-based multi-agent applicationsβ62Updated this week
- Repository for the paper Stream of Search: Learning to Search in Languageβ84Updated 3 months ago
- β56Updated 8 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetβ13Updated 8 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?β34Updated 2 weeks ago
- β76Updated 10 months ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"β71Updated 10 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding forβ¦β20Updated last month
- Discovering Data-driven Hypotheses in the Wildβ39Updated 2 weeks ago