MiniMax-AI / MiniMax-M2.1Links
MiniMax M2.1, a SOTA model for real-world dev & agents.
☆491Updated last week
Alternatives and similar repositories for MiniMax-M2.1
Users that are interested in MiniMax-M2.1 are comparing it to the libraries listed below
Sorting:
- ☆865Updated 4 months ago
- Fast, Sharp & Reliable Agentic Intelligence☆492Updated this week
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆597Updated 3 weeks ago
- Moonshot's most powerful model☆293Updated last week
- ☆1,283Updated 2 months ago
- This is the official repo for the paper "LongCat-Flash-Omni Technical Report"☆470Updated 2 weeks ago
- OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.☆631Updated 3 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆558Updated 3 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆738Updated 8 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆569Updated 2 months ago
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆800Updated 3 months ago
- ☆1,306Updated last week
- A reproduction of the Deepseek-OCR model including training☆209Updated 2 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆812Updated 6 months ago
- MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model☆1,041Updated 3 weeks ago
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆965Updated 4 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆726Updated 2 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆833Updated last month
- A Scientific Multimodal Foundation Model☆629Updated 4 months ago
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆724Updated last month
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆301Updated 4 months ago
- The official repository of the dots.llm1 base and instruct models proposed by rednote-hilab.☆487Updated 5 months ago
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…☆378Updated 2 weeks ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆618Updated last week
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆411Updated 3 months ago
- ☆1,300Updated this week
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆296Updated 3 weeks ago
- MiniMax-M2, a model built for Max coding & agentic workflows.☆2,338Updated 2 months ago
- ☆507Updated last week
- GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.☆734Updated this week