deepseek-ai / Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
☆17,117Updated 2 months ago
Alternatives and similar repositories for Janus:
Users that are interested in Janus are comparing it to the libraries listed below
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆9,844Updated last week
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,713Updated last month
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,634Updated 6 months ago
- DeepSeek Coder: Let the Code Write Itself☆21,343Updated 11 months ago
- ☆95,761Updated last week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆16,726Updated last month
- ☆88,609Updated last week
- DeepSeek LLM: Let there be answers☆6,303Updated last year
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,777Updated 11 months ago
- Fully open reproduction of DeepSeek-R1☆24,020Updated this week
- ☆3,284Updated last month
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆17,910Updated 3 weeks ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,870Updated 6 months ago
- Official inference repo for FLUX.1 models☆21,357Updated 2 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆9,700Updated this week
- Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆37,183Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆4,832Updated last month
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆2,634Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,237Updated 8 months ago
- FlashMLA: Efficient MLA decoding kernels☆11,448Updated last month
- Wan: Open and Advanced Large-Scale Video Generative Models☆10,234Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆45,116Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆13,368Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,514Updated last week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,998Updated last week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,627Updated this week
- Integrate the DeepSeek API into popular softwares☆31,632Updated last week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,782Updated 8 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,626Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,564Updated this week