thesofakillers / aidemlLinks
AIDE: the Machine Learning CodeGen Agent
β24Updated 10 months ago
Alternatives and similar repositories for aideml
Users that are interested in aideml are comparing it to the libraries listed below
Sorting:
- π§ Compare how Agent systems perform on several benchmarks. ππβ100Updated last month
- β40Updated 8 months ago
- β76Updated 7 months ago
- Official Repo for CRMArena and CRMArena-Proβ110Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.β79Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 4 months ago
- Verifiers for LLM Reinforcement Learningβ71Updated 4 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ107Updated 8 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β69Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ114Updated this week
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β116Updated 11 months ago
- Automating enterprise workflows with multimodal agentsβ110Updated 10 months ago
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β145Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ88Updated 11 months ago
- The first dense retrieval model that can be prompted like an LMβ86Updated 3 months ago
- Official page for ICLR 2025 paper "Sufficient Context: A New Lens on Retrieval Augmented Generation Systems"β50Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β75Updated 8 months ago
- β28Updated 5 months ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?β129Updated last year
- β145Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β48Updated last year
- Score LLM pretraining data with classifiersβ55Updated last year
- LLM reads a paper and produce a working prototypeβ57Updated 4 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β55Updated 11 months ago
- Agent that routes to different tools - LLM classifier SDKβ44Updated last year
- Simple GRPO scripts and configurations.β59Updated 6 months ago
- β67Updated 5 months ago
- Train your own SOTA deductive reasoning modelβ105Updated 5 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Modelsβ97Updated last year
- β94Updated 5 months ago