deepseek-ai / DeepSeek-R1
☆88,609Updated 2 weeks ago
Alternatives and similar repositories for DeepSeek-R1:
Users that are interested in DeepSeek-R1 are comparing it to the libraries listed below
- ☆95,761Updated 2 weeks ago
- DeepSeek Coder: Let the Code Write Itself☆21,343Updated 11 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,140Updated 2 months ago
- Fully open reproduction of DeepSeek-R1☆24,020Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆16,726Updated last month
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,740Updated last month
- DeepSeek LLM: Let there be answers☆6,316Updated last year
- Integrate the DeepSeek API into popular softwares☆31,802Updated last week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆17,910Updated 3 weeks ago
- FlashMLA: Efficient MLA decoding kernels☆11,448Updated last month
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆137,814Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆10,234Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆43,678Updated this week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆13,611Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆90,546Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆49,576Updated this week
- Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆37,364Updated this week
- Make websites accessible for AI agents☆56,568Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆43,054Updated this week
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆15,697Updated this week
- DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence☆5,634Updated 6 months ago
- DeepEP: an efficient expert-parallel communication library☆7,446Updated last week
- The AI Code Editor☆29,397Updated 6 months ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆74,545Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆41,142Updated this week
- A Gradio web UI for Large Language Models with support for multiple inference backends.☆43,274Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆47,163Updated this week
- The Memory layer for AI Agents☆27,790Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆9,844Updated last week
- Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, m…☆92,934Updated this week