InternLM/Intern-S1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InternLM/Intern-S1)

InternLM / Intern-S1

A Scientific Multimodal Foundation Model

☆838

Alternatives and similar repositories for Intern-S1

Users that are interested in Intern-S1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InternLM / POLAR
View on GitHub
Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
☆166Sep 23, 2025Updated 10 months ago
OpenIXCLab / CODA
View on GitHub
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
☆37Aug 28, 2025Updated 10 months ago
ByteDance-Seed / Seed1.5-VL
View on GitHub
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…
☆1,583Jun 14, 2025Updated last year
OpenEvaluation / VLMEvalKit
View on GitHub
☆23Apr 11, 2026Updated 3 months ago
InternScience / InternAgent
View on GitHub
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery
☆1,379Jun 10, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
InternLM / InternBootcamp
View on GitHub
Official implement on InternBootCamp
☆348Updated this week
InternScience / Awesome-Scientific-Datasets-and-LLMs
View on GitHub
A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)
☆457Oct 3, 2025Updated 9 months ago
zai-org / GLM-V
View on GitHub
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
☆2,355Updated this week
ByteDance-Seed / seed-oss
View on GitHub
☆888Sep 15, 2025Updated 10 months ago
MoonshotAI / Kimi-VL
View on GitHub
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆1,206Jul 15, 2025Updated last year
MoonshotAI / Kimi-Linear
View on GitHub
☆1,481Nov 17, 2025Updated 8 months ago
PhoenixZ810 / OmniAlign-V
View on GitHub
Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
☆144Apr 2, 2026Updated 3 months ago
InternLM / xtuner
View on GitHub
A Next-Generation Training Engine Built for Ultra-Large MoE Models
☆5,163Updated this week
InternLM / ARC-VL
View on GitHub
[CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"
☆46Nov 26, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
open-compass / VLMEvalKit
View on GitHub
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
☆4,295Updated this week
XiaomiMiMo / MiMo-VL
View on GitHub
MiMo-VL
☆642Aug 21, 2025Updated 11 months ago
InternLM / OREAL
View on GitHub
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆190Mar 20, 2025Updated last year
InternScience / SciEvalKit
View on GitHub
A unified evaluation toolkit and leaderboard for rigorously assessing the scientific intelligence of large language and vision–language m…
☆85Jun 17, 2026Updated last month
InternScience / SGI-Bench
View on GitHub
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
☆167Jun 2, 2026Updated last month
OpenGVLab / InternVL-U
View on GitHub
InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image edit…
☆291Mar 21, 2026Updated 4 months ago
InternLM / Agent-FLAN
View on GitHub
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
☆361Mar 22, 2024Updated 2 years ago
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,109May 4, 2026Updated 2 months ago
OpenGVLab / InternVL
View on GitHub
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
☆10,099Sep 22, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,537Dec 30, 2025Updated 6 months ago
InternLM / InternLM-XComposer
View on GitHub
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
☆2,921May 26, 2025Updated last year
ByteDance-Seed / Seed-Thinking-v1.5
View on GitHub
☆811Jun 9, 2025Updated last year
PhoenixZ810 / RISEBench
View on GitHub
[NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
☆155May 18, 2026Updated 2 months ago
open-compass / TextEdit
View on GitHub
We provide TextEdit, a high-quality, multi-scenario text editing benchmark for generation models.
☆20Mar 16, 2026Updated 4 months ago
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,705Feb 27, 2026Updated 4 months ago
inclusionAI / Ming
View on GitHub
Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.
☆664Mar 17, 2026Updated 4 months ago
ByteDance-Seed / Seed-1.8
View on GitHub
☆219Dec 19, 2025Updated 7 months ago
open-mmlab / mmeval
View on GitHub
A unified evaluation library for multiple machine learning libraries
☆269Mar 29, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
QwenLM / Qwen3-Omni
View on GitHub
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…
☆3,903Apr 23, 2026Updated 3 months ago
InternLM / InternLM
View on GitHub
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
☆7,247Oct 30, 2025Updated 8 months ago
studio-dots-ai / dots.vlm1
View on GitHub
The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.
☆288Sep 26, 2025Updated 9 months ago
Mini-o3 / Mini-o3
View on GitHub
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
☆422Jan 29, 2026Updated 5 months ago
opendatalab / REST
View on GitHub
☆34Jul 15, 2025Updated last year
open-compass / opencompass
View on GitHub
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆7,230Updated this week
yfzhang114 / Thyme
View on GitHub
✨✨ [ICLR 2026] Think Beyond Images
☆583Sep 23, 2025Updated 10 months ago