MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
☆1,328Jan 8, 2026Updated 5 months ago
Alternatives and similar repositories for MiMo-V2-Flash
Users that are interested in MiMo-V2-Flash are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆2,158Jun 5, 2025Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- [ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆120Apr 8, 2026Updated 2 months ago
- Github repo for ICLR-2025 paper, Fine-tuning Large Language Models with Sparse Matrices☆26Feb 2, 2026Updated 4 months ago
- ☆87Sep 25, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models☆153Updated this week
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆40Apr 11, 2026Updated 2 months ago
- [AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model☆81Apr 7, 2026Updated 2 months ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆43Oct 29, 2025Updated 7 months ago
- ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…☆65Mar 3, 2026Updated 3 months ago
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆18Nov 24, 2024Updated last year
- ☆44Feb 26, 2026Updated 3 months ago
- mKernel: fast multi-node, multi-GPU fused kernels☆231Updated this week
- ICTNet: a novel network for semantic segmentation with the underlying architecture of a fully convolutional network, infused with feature…☆10May 27, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Quartet II Official Code☆74May 1, 2026Updated last month
- ☆812Jun 9, 2025Updated last year
- ☆71Dec 30, 2025Updated 5 months ago
- ☆14Sep 30, 2024Updated last year
- ☆50Nov 9, 2025Updated 7 months ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆81May 2, 2026Updated last month
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- 📚 Some notes and projects of courses in Shanghai Jiao Tong University☆15Jan 30, 2026Updated 4 months ago
- ☆71May 2, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Website for CSE 234, Winter 2025☆15Mar 24, 2025Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- Self Evolving Large Multimodal Models with Continuous Rewards☆24Updated this week
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆112May 11, 2026Updated last month
- This is the official repo for the paper "LongCat-Flash-Omni Technical Report"☆492May 9, 2026Updated last month
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆45Nov 26, 2025Updated 6 months ago
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆298Dec 1, 2025Updated 6 months ago
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆41Dec 16, 2025Updated 5 months ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆98Dec 27, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2026] d3LLM: Ultra-Fast Diffusion LLM 🚀☆140May 1, 2026Updated last month
- ☆25Mar 17, 2026Updated 2 months ago
- Data recipes and robust infrastructure for training AI agents☆163Updated this week
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆1,045Mar 3, 2026Updated 3 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Apr 10, 2026Updated 2 months ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆22Jan 16, 2025Updated last year
- Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…☆1,580Jun 14, 2025Updated last year