thesofakillers / aidemlLinks
AIDE: the Machine Learning CodeGen Agent
β25Updated last year
Alternatives and similar repositories for aideml
Users that are interested in aideml are comparing it to the libraries listed below
Sorting:
- π§ Compare how Agent systems perform on several benchmarks. ππβ103Updated 6 months ago
- β82Updated 3 months ago
- β39Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Official Repo for CRMArena and CRMArena-Proβ132Updated this week
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β151Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β69Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ126Updated 3 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ115Updated 10 months ago
- Automating enterprise workflows with multimodal agentsβ115Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ46Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Modelsβ101Updated 2 years ago
- β28Updated 10 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β50Updated 2 years ago
- Verifiers for LLM Reinforcement Learningβ80Updated 9 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ112Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β120Updated 3 months ago
- β63Updated last year
- Beating the GAIA benchmark with Transformers Agents. πβ146Updated 11 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β67Updated last year
- LLM reads a paper and produce a working prototypeβ60Updated 9 months ago
- β54Updated 3 weeks ago
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- EcoAssistant: using LLM assistant more affordably and accuratelyβ134Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ43Updated last year
- Agent that routes to different tools - LLM classifier SDKβ45Updated last year
- β41Updated last year
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.β33Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ148Updated last year
- β147Updated last year