MLSysOps / MLE-agentLinks
π€ MLE-Agent: Your intelligent companion for seamless AI engineering and research. π Integrate with arxiv and paper with code to provide better code/research plans π§° OpenAI, Anthropic, Gemini, Ollama, etc supported. Code RAG
β1,288Updated 2 weeks ago
Alternatives and similar repositories for MLE-agent
Users that are interested in MLE-agent are comparing it to the libraries listed below
Sorting:
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 β¦β844Updated 10 months ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,315Updated 4 months ago
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.β917Updated last month
- OO for LLMsβ779Updated last week
- A reading list on LLM based Synthetic Data Generation π₯β1,287Updated 2 weeks ago
- Reaching LLaMA2 Performance with 0.1M Dollarsβ979Updated 10 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β918Updated 4 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI appβ1,799Updated this week
- β1,785Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,568Updated last week
- Synthetic data curation for post-training and structured data extractionβ1,364Updated this week
- Prompt optimization scratchβ746Updated last month
- OpenResearcher, an advanced Scientific Research Assistantβ450Updated 7 months ago
- System 2 Reasoning Link Collectionβ835Updated 2 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engineβ461Updated 4 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ547Updated last week
- Deploy your agentic worfklows to productionβ2,010Updated this week
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β701Updated last month
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ728Updated 2 weeks ago
- β722Updated 2 weeks ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β920Updated last week
- The easiest way to deploy agents, models, RAG, pipelines and more. No MLOps. No YAML.β3,176Updated this week
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-apiβ1,120Updated this week
- Autonomous Agents (LLMs) research papers. Updated Daily.β819Updated last week
- β2,952Updated 8 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,066Updated 4 months ago
- β437Updated 8 months ago
- Llama-3 agents that can browse the web by following instructions and talking to youβ1,404Updated 5 months ago
- proof of concept prototype for generating and querying against an ever-expanding knowledge graph with aiβ900Updated last year
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,391Updated 4 months ago