hpcaitech / Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
☆26,355Updated this week
Alternatives and similar repositories for Open-Sora:
Users that are interested in Open-Sora are comparing it to the libraries listed below
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆11,950Updated last month
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,329Updated last month
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,149Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,354Updated 8 months ago
- Generative Models by Stability AI☆25,802Updated last month
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,281Updated 7 months ago
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆37,861Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,346Updated 2 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆19,634Updated this week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆26,626Updated last week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆43,502Updated this week
- SOTA Open Source TTS☆20,921Updated 3 weeks ago
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,657Updated 8 months ago
- Making large AI models cheaper, faster and more accessible☆40,842Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,502Updated 3 weeks ago
- Official inference repo for FLUX.1 models☆21,541Updated 2 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,077Updated 3 months ago
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆44,034Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆75,825Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆38,206Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46,456Updated this week
- Latest Advances on Multimodal Large Language Models☆14,909Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆92,548Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆37,822Updated this week
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,585Updated 9 months ago
- Enjoy the magic of Diffusion models!☆8,509Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,842Updated 8 months ago
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆139,442Updated this week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆7,966Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆24,899Updated this week