MiroTrain is an efficient and algorithm-first framework research agent.
☆137Aug 27, 2025Updated 6 months ago
Alternatives and similar repositories for MiroTrain
Users that are interested in MiroTrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆240Aug 27, 2025Updated 6 months ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆266Aug 12, 2025Updated 7 months ago
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year
- 本插件包含一些有趣的Word小工具,如规划Pre时间、提取Word中图片的原图、便捷的API翻译和GPT for Word。☆11Mar 13, 2025Updated last year
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆50Aug 20, 2025Updated 7 months ago
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆130Feb 10, 2026Updated last month
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 8 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆101Sep 8, 2025Updated 6 months ago
- ☆31Jun 29, 2022Updated 3 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆50Mar 2, 2026Updated 3 weeks ago
- LLM智能路由网关、 Enterprise Intelligent AI-API Distribution Gateway☆13Jan 24, 2025Updated last year
- The official code repository for the FullFront benchmark☆27May 16, 2025Updated 10 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 4 months ago
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆41Jun 6, 2024Updated last year
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆32Updated this week
- ☆32Sep 19, 2025Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Jan 26, 2025Updated last year
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆58Dec 13, 2024Updated last year
- Fine-tune of Florence-2 for shot categorization.☆26Mar 6, 2025Updated last year
- This project provides the source code for “Collaborative Unsupervised Domain Adaptation for Medical Image Diagnosis (IEEE TIP 2020)”.☆11Jun 30, 2021Updated 4 years ago
- The dataset used in COVID-DA: Deep Domain Adaptation from Typical Pneumonia to COVID-19☆12Nov 22, 2022Updated 3 years ago
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆104Jul 18, 2025Updated 8 months ago
- CoCo: CoCo as CoT for Text-to-Image Preview and Rare Concept Generation☆49Mar 10, 2026Updated 2 weeks ago
- A research project exploring fine-tuning BERT-style models for text generation☆39Nov 30, 2025Updated 3 months ago
- Environments, tools, and benchmarks for general computer agents☆14Dec 3, 2024Updated last year
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆419Aug 21, 2025Updated 7 months ago
- [ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe☆257Mar 11, 2026Updated last week
- FreeVA: Offline MLLM as Training-Free Video Assistant☆69Jun 9, 2024Updated last year
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆78Feb 13, 2025Updated last year
- [ICLR2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆30Feb 4, 2026Updated last month
- ☆17Dec 11, 2024Updated last year
- ☆14Mar 10, 2020Updated 6 years ago
- ☆16Jul 23, 2024Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Jun 7, 2024Updated last year
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆73Sep 8, 2025Updated 6 months ago
- init☆11May 25, 2025Updated 9 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- ☆16Jul 17, 2025Updated 8 months ago