thesofakillers / aidemlLinks
AIDE: the Machine Learning CodeGen Agent
☆24Updated 8 months ago
Alternatives and similar repositories for aideml
Users that are interested in aideml are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆98Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 9 months ago
- ☆24Updated 9 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆55Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆50Updated 3 weeks ago
- ☆41Updated 6 months ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- ☆48Updated 2 weeks ago
- LLM reads a paper and produce a working prototype☆57Updated 2 months ago
- ☆47Updated 4 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆112Updated 9 months ago
- ☆45Updated 10 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆109Updated 3 weeks ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆91Updated 2 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 8 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆72Updated last week
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆66Updated 11 months ago
- ☆65Updated 2 months ago
- accompanying material for sleep-time compute paper☆95Updated last month
- ☆61Updated 3 weeks ago
- ☆32Updated last month
- The first dense retrieval model that can be prompted like an LM☆73Updated last month
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆30Updated 2 months ago
- ☆127Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Official Repo for CRMArena and CRMArena-Pro☆92Updated last week
- ☆51Updated 7 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆82Updated 8 months ago