QwenLM / Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
☆15,277Updated last week
Alternatives and similar repositories for Qwen2.5:
Users that are interested in Qwen2.5 are comparing it to the libraries listed below
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆16,825Updated 2 weeks ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆7,261Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,736Updated 4 months ago
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,777Updated 3 weeks ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆40,309Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆29,153Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆37,867Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆4,442Updated this week
- Janus-Series: Unified Multimodal Understanding and Generation Models☆15,784Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆9,541Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,189Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆18,430Updated this week
- DeepSeek Coder: Let the Code Write Itself☆19,418Updated 8 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆5,861Updated 3 weeks ago
- The official Meta Llama 3 GitHub site☆28,292Updated 2 weeks ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆3,592Updated last week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆5,667Updated this week
- Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 15…☆5,418Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,441Updated 6 months ago
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,062Updated 4 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆18,676Updated 4 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆5,563Updated this week
- A series of large language models trained from scratch by developers @01-ai☆7,815Updated 2 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,001Updated last month
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,425Updated 9 months ago