deepseek-ai / DeepSeek-LLM
DeepSeek LLM: Let there be answers
☆1,850Updated 11 months ago
Alternatives and similar repositories for DeepSeek-LLM:
Users that are interested in DeepSeek-LLM are comparing it to the libraries listed below
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,035Updated 3 months ago
- A lightweight framework for building LLM-based agents☆1,978Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆5,169Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,378Updated last month
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆797Updated this week
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,083Updated last year
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,158Updated this week
- ☆1,044Updated this week
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆954Updated 9 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,204Updated 4 months ago
- AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:☆1,885Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆7,353Updated this week
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆2,797Updated 3 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆2,306Updated 8 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,015Updated 3 weeks ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,383Updated last year
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,320Updated this week
- ☆1,137Updated last month
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆765Updated last year
- LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…☆2,762Updated this week
- InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions☆2,709Updated 3 weeks ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,326Updated 2 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,315Updated 5 months ago
- Scalable RL solution for advanced reasoning of language models☆873Updated this week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,060Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,620Updated this week
- Large Reasoning Models☆787Updated last month
- Janus-Series: Unified Multimodal Understanding and Generation Models☆1,327Updated 2 months ago
- ☆902Updated 6 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,398Updated 9 months ago