deepseek-ai / DeepSeek-V3
☆19,214Updated last week
Alternatives and similar repositories for DeepSeek-V3:
Users that are interested in DeepSeek-V3 are comparing it to the libraries listed below
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆11,800Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆23,110Updated 3 weeks ago
- DeepSeek Coder: Let the Code Write Itself☆10,177Updated 7 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆35,268Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆15,910Updated this week
- The official Meta Llama 3 GitHub site☆27,957Updated 5 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆15,373Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆33,809Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆22,748Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.☆3,878Updated this week
- Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory☆20,611Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,035Updated 3 months ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆18,680Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆21,705Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆37,496Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,057Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆36,255Updated this week
- Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accele…☆6,946Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,096Updated 5 months ago
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆24,558Updated this week
- SOTA Open Source TTS☆18,396Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆13,445Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆37,558Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,719Updated last week
- llama3 implementation one matrix multiplication at a time☆14,030Updated 7 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆7,425Updated this week
- A series of large language models trained from scratch by developers @01-ai☆7,784Updated last month
- A generative world for general-purpose robotics & embodied AI learning.☆22,786Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆38,227Updated this week
- Inference code for Llama models☆57,227Updated 5 months ago