The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"
☆58Jun 21, 2025Updated 9 months ago
Alternatives and similar repositories for ML-Agent
Users that are interested in ML-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Oct 19, 2025Updated 5 months ago
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆34Nov 1, 2025Updated 5 months ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆24Sep 23, 2025Updated 6 months ago
- ☆16Mar 6, 2025Updated last year
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- R1V, trained with AI feedback, answers open-ended visual questions.☆14Apr 12, 2025Updated 11 months ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆55Updated this week
- ☆12Jan 25, 2024Updated 2 years ago
- ☆39Jun 14, 2025Updated 9 months ago
- A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation☆13Dec 22, 2022Updated 3 years ago
- ☆33Aug 26, 2025Updated 7 months ago
- ☆80Mar 6, 2026Updated last month
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Sep 20, 2025Updated 6 months ago
- Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).☆35Mar 22, 2026Updated 2 weeks ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems☆121Jul 13, 2025Updated 8 months ago
- A Shared Nearest Neighbors clustering implementation. This code is basically a wrapper of sklearn DBSCAN, implementing the neighborhood s…☆16Jan 10, 2022Updated 4 years ago
- ☆13Jul 14, 2024Updated last year
- Scaling Preference Data Curation via Human-AI Synergy☆146Jul 3, 2025Updated 9 months ago
- Data and codes for MetroGAN☆16Dec 23, 2024Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆97Aug 20, 2024Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Sci. Rep. 2025 | Revisiting model scaling with a U-net benchmark for 3D medical image segmentation☆18Aug 21, 2025Updated 7 months ago
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆21Feb 10, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Using Large Language Models for Hyperparameter Optimization☆28May 13, 2024Updated last year
- This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.☆10Aug 23, 2021Updated 4 years ago
- [NeurIPS 2023 AI4Science] "A Transformer Model for Symbolic Regression towards Scientific Discovery"☆17Dec 16, 2023Updated 2 years ago
- A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).☆60Nov 17, 2021Updated 4 years ago
- Container-free RL framework for training software engineering agents☆50Mar 4, 2026Updated last month
- DMALab's reading group slides and papers.☆16Jun 8, 2021Updated 4 years ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆34Aug 20, 2025Updated 7 months ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆49Sep 15, 2025Updated 6 months ago
- ☆13Jan 14, 2022Updated 4 years ago
- A First Look at Conventional Commits Classification☆13Nov 18, 2024Updated last year
- ☆38Jan 25, 2026Updated 2 months ago
- ☆20May 24, 2025Updated 10 months ago
- ☆88Aug 16, 2025Updated 7 months ago
- InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery☆1,269Mar 17, 2026Updated 3 weeks ago