The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"
☆60Jun 21, 2025Updated 10 months ago
Alternatives and similar repositories for ML-Agent
Users that are interested in ML-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Feb 20, 2024Updated 2 years ago
- [ACL 2025] DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues☆26Jul 10, 2025Updated 9 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 6 months ago
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆28Sep 23, 2025Updated 7 months ago
- ☆16Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- ☆94Oct 30, 2025Updated 6 months ago
- ☆12Jan 25, 2024Updated 2 years ago
- Graph Coarsening with Neural Networks☆11Mar 3, 2022Updated 4 years ago
- [ICLR 2026] SR-Scientist: Scientific Equation Discovery With Agentic AI☆39Jan 27, 2026Updated 3 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 11 months ago
- Gym-Anything: Turn any Software into an Agent Environment☆184Apr 8, 2026Updated 3 weeks ago
- ☆33Aug 26, 2025Updated 8 months ago
- The code of Dynamic Graph Learning Based on Hierarchical Memory for Origin-Destination Demand Prediction☆14Apr 29, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆103Mar 6, 2026Updated last month
- [AAAI 2025] The official code of the paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct"(http…☆14Jul 10, 2024Updated last year
- ☆15Mar 6, 2024Updated 2 years ago
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- ☆19Dec 20, 2025Updated 4 months ago
- ☆13Jul 14, 2024Updated last year
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆68Apr 9, 2026Updated 3 weeks ago
- Data and codes for MetroGAN☆16Dec 23, 2024Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆98Aug 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10May 10, 2024Updated last year
- ☆20Apr 23, 2026Updated last week
- Sci. Rep. 2025 | Revisiting model scaling with a U-net benchmark for 3D medical image segmentation☆18Aug 21, 2025Updated 8 months ago
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆21Feb 10, 2025Updated last year
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Aug 23, 2021Updated 4 years ago
- A PyTorch native library for large model training☆28Apr 1, 2026Updated 3 weeks ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆35Aug 20, 2025Updated 8 months ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 7 months ago
- ☆13Jan 14, 2022Updated 4 years ago
- A First Look at Conventional Commits Classification☆13Nov 18, 2024Updated last year
- 本项目设计一个可以产生21种音阶的电子琴,由PS2键盘完成输入,在Basys2板识别处理后,产生特定频率声音,最后通过Pmod_AMP模块发出。☆10Jul 21, 2019Updated 6 years ago
- ☆53Apr 17, 2026Updated last week
- Artifact evaluation of MobiSys25 SynCheck☆20Mar 24, 2025Updated last year
- InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery☆1,284Mar 17, 2026Updated last month