PKU-YuanGroup / Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
☆11,247Updated this week
Related projects: ⓘ
- Open-Sora: Democratizing Efficient Video Production for All☆21,609Updated last month
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- Create Magic Story!☆5,787Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆19,294Updated last month
- Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)☆30,812Updated this week
- MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.☆6,824Updated last week
- ☆7,075Updated last month
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆13,305Updated 2 weeks ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,451Updated 5 months ago
- ModelScope: bring the notion of Model-as-a-Service to life.☆6,794Updated this week
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆8,881Updated last month
- Your image is almost there!☆7,207Updated last month
- a state-of-the-art-level open visual language model | 多模态预训练模型☆5,871Updated 3 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆7,267Updated this week
- Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.☆7,468Updated this week
- The official Meta Llama 3 GitHub site☆26,122Updated last month
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆5,491Updated last week
- Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory☆15,611Updated this week
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆10,850Updated 2 months ago
- An Autonomous LLM Agent for Complex Task Solving☆8,033Updated last month
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆10,156Updated last week
- A series of large language models trained from scratch by developers @01-ai☆7,598Updated last week
- Enjoy the magic of Diffusion models!☆6,349Updated this week
- High-speed Large Language Model Serving on PCs with Consumer-grade GPUs☆7,877Updated last week
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,359Updated 2 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,498Updated 2 months ago
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation☆9,465Updated last month
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆16,112Updated last month
- Generative Models by Stability AI☆24,064Updated 2 weeks ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,175Updated 4 months ago