Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.
☆2,236Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for Qwen3.5
Users that are interested in Qwen3.5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,753Jan 30, 2026Updated last month
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated last week
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- MiMo-VL☆629Aug 21, 2025Updated 7 months ago
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆45Mar 16, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆19Mar 16, 2026Updated last week
- Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…☆1,558Jun 14, 2025Updated 9 months ago
- Solve Visual Understanding with Reinforced VLMs☆5,872Mar 12, 2026Updated 2 weeks ago
- LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion☆76Mar 18, 2026Updated last week
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆3,958Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆26,969Jan 9, 2026Updated 2 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generation☆245Oct 13, 2025Updated 5 months ago
- Open-source unified multimodal model☆5,761Oct 27, 2025Updated 5 months ago
- Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)☆702Sep 24, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆3,537Jan 8, 2026Updated 2 months ago
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,168Jul 15, 2025Updated 8 months ago
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆115Dec 3, 2025Updated 3 months ago
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,959Jun 12, 2025Updated 9 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,748Mar 10, 2026Updated 2 weeks ago
- Training Autoregressive Image Generation models via Reinforcement Learning☆51Nov 26, 2025Updated 4 months ago
- A fork to add multimodal model training to open-r1☆1,507Feb 8, 2025Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex☆744Mar 19, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆143Aug 21, 2025Updated 7 months ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆39Nov 26, 2025Updated 4 months ago
- ☆4,607Sep 14, 2025Updated 6 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,904Sep 22, 2025Updated 6 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 6 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,940Aug 15, 2024Updated last year
- ☆1,122Feb 2, 2026Updated last month
- Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’☆2,317Oct 29, 2025Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, …☆13,263Mar 20, 2026Updated last week
- Witness the aha moment of VLM with less than $3.☆4,041May 19, 2025Updated 10 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 3 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,762May 11, 2025Updated 10 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆68Mar 20, 2026Updated last week
- ☆18Jun 10, 2023Updated 2 years ago
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆19Nov 15, 2025Updated 4 months ago