thesofakillers / aidemlLinks

AIDE: the Machine Learning CodeGen Agent

☆24

Alternatives and similar repositories for aideml

Users that are interested in aideml are comparing it to the libraries listed below

Sorting:

bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆60Updated 2 months ago
aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆98Updated 8 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆78Updated 9 months ago
dinobby / MAgICoRE
☆24Updated 9 months ago
LiqiangJing / DSBench
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
☆55Updated 4 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
yueqis / API-Based-Agent
☆50Updated 3 weeks ago
miralab-ai / autoreason
☆41Updated 6 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 7 months ago
SiliangZeng / Multi-Turn-RL-Agent
☆48Updated 2 weeks ago
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆57Updated 2 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆47Updated 4 months ago
wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆112Updated 9 months ago
awslabs / rag-qa-arena
☆45Updated 10 months ago
Nardien / agent-distillation
Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"
☆109Updated 3 weeks ago
facebookresearch / collaborative-reasoner
Source code for the collaborative reasoner research project at Meta FAIR.
☆91Updated 2 months ago
ytyz1307zzh / RefAug
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆54Updated 8 months ago
zjunlp / KnowSelf
[ACL 2025] Agentic Knowledgeable Self-awareness
☆72Updated last week
rhyang2021 / SELFGOAL
Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".
☆66Updated 11 months ago
du-nlp-lab / MLR-Copilot
☆65Updated 2 months ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆95Updated last month
allenai / infinigram-api
☆61Updated 3 weeks ago
sunblaze-ucb / reasoning_ladder
☆32Updated last month
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆73Updated last month
axolotl-ai-cloud / grpo_code
A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆30Updated 2 months ago
PrimeIntellect-ai / genesys
☆127Updated 3 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
SalesforceAIResearch / CRMArena
Official Repo for CRMArena and CRMArena-Pro
☆92Updated last week
arcee-ai / DAM
☆51Updated 7 months ago
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆82Updated 8 months ago