HazyResearch / eclair-agents
Automating enterprise workflows with multimodal agents
☆105Updated 6 months ago
Alternatives and similar repositories for eclair-agents:
Users that are interested in eclair-agents are comparing it to the libraries listed below
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆77Updated 6 months ago
- AWM: Agent Workflow Memory☆257Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 4 months ago
- Train your own SOTA deductive reasoning model☆83Updated last month
- ☆120Updated 3 weeks ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated 2 months ago
- Code for ScribeAgent paper☆55Updated last month
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆121Updated last month
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆146Updated 2 months ago
- ☆145Updated last month
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆75Updated last month
- Just a bunch of benchmark logs for different LLMs☆119Updated 8 months ago
- ☆74Updated 5 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 9 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆101Updated 8 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 9 months ago
- LLM reads a paper and produce a working prototype☆52Updated this week
- ☆77Updated 10 months ago
- ☆119Updated 8 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Verdict is a library for scaling judge-time compute.☆195Updated 3 weeks ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆49Updated 2 weeks ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated last week
- WIP - Allows you to create DSPy pipelines using ComfyUI☆187Updated 4 months ago
- Code for ExploreTom☆79Updated 4 months ago
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆50Updated 2 weeks ago
- ☆53Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago