QwenLM / Qwen3
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆20,762Updated this week
Alternatives and similar repositories for Qwen3
Users that are interested in Qwen3 are comparing it to the libraries listed below
Sorting:
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,214Updated 2 weeks ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10,262Updated last week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆48,565Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆8,313Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46,848Updated this week
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆38,553Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,232Updated 3 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆14,188Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆6,542Updated 3 weeks ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆13,971Updated this week
- Fully open reproduction of DeepSeek-R1☆24,340Updated this week
- The official Meta Llama 3 GitHub site☆28,671Updated 3 months ago
- Retrieval and Retrieval-augmented LLMs☆9,610Updated last month
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆4,908Updated last month
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆7,802Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,878Updated 9 months ago
- DeepSeek Coder: Let the Code Write Itself☆21,474Updated 11 months ago
- ☆96,547Updated last month
- Open-Sora: Democratizing Efficient Video Production for All☆26,401Updated 2 weeks ago
- Memory for AI Agents; SOTA in AI Agent Memory, beating OpenAI Memory in accuracy by 26% - https://mem0.ai/research☆29,071Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆139,991Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆21,960Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,078Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,462Updated 9 months ago
- ☆89,167Updated last month
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,887Updated 7 months ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) an…☆7,450Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,258Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,188Updated last week
- a state-of-the-art-level open visual language model | 多模态预训练模型☆6,531Updated 11 months ago