InternLM / Intern-S1Links
A Scientific Multimodal Foundation Model
☆629Updated 4 months ago
Alternatives and similar repositories for Intern-S1
Users that are interested in Intern-S1 are comparing it to the libraries listed below
Sorting:
- ☆491Updated last month
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆746Updated last month
- Fully Open Framework for Democratized Multimodal Training☆703Updated last month
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆349Updated 8 months ago
- 🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal rei…☆194Updated last month
- MiMo-VL☆622Updated 5 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆250Updated 4 months ago
- ☆865Updated 4 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆292Updated 2 weeks ago
- Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization☆349Updated 3 weeks ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆553Updated 2 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆384Updated 5 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆251Updated 2 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆330Updated 8 months ago
- Survey and paper list on efficiency-guided LLM agents (memory, tool learning, planning).☆122Updated this week
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…☆378Updated last week
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆546Updated 2 months ago
- A reproduction of the Deepseek-OCR model including training☆206Updated 2 months ago
- Step-DeepResearch☆479Updated last week
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆588Updated 2 weeks ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆570Updated 4 months ago
- ☆517Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆532Updated 4 months ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆318Updated 2 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆622Updated 10 months ago
- An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"☆178Updated last month
- MiroTrain is an efficient and algorithm-first framework research agent.☆132Updated 5 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆875Updated 6 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆394Updated 4 months ago
- Towards a Unified View of Large Language Model Post-Training☆199Updated 4 months ago