MiniMax-AI / MiniMax-01
☆1,973Updated last week
Alternatives and similar repositories for MiniMax-01:
Users that are interested in MiniMax-01 are comparing it to the libraries listed below
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆1,005Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,410Updated 2 months ago
- Next-Token Prediction is All You Need☆1,976Updated 3 months ago
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,768Updated last week
- ☆1,150Updated 2 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆801Updated last week
- veRL: Volcano Engine Reinforcement Learning for LLM☆1,135Updated this week
- Sky-T1: Train your own O1 preview model within $450☆2,214Updated this week
- ☆1,374Updated last month
- Codebase for Aria - an Open Multimodal Native MoE☆978Updated last week
- ☆3,316Updated 3 months ago
- Large Reasoning Models☆801Updated last month
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,126Updated last year
- Scalable RL solution for advanced reasoning of language models☆974Updated this week
- Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆4,328Updated this week
- ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction☆2,011Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆735Updated this week
- VideoSys: An easy and efficient system for video generation☆1,891Updated 3 weeks ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,913Updated 6 months ago
- DataComp for Language Models☆1,209Updated last month
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆2,556Updated 9 months ago
- Recipes to scale inference-time compute of open models☆971Updated last week
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆626Updated this week
- Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆875Updated this week
- [NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which r…☆890Updated this week
- An open-sourced end-to-end VLM-based GUI Agent☆628Updated last week