lyogavin / train_your_own_soraView external linksLinks
☆204Mar 7, 2024Updated last year
Alternatives and similar repositories for train_your_own_sora
Users that are interested in train_your_own_sora are comparing it to the libraries listed below
Sorting:
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,917Oct 30, 2025Updated 3 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆673Oct 25, 2024Updated last year
- ☆10Apr 24, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Apr 7, 2024Updated last year
- NeurIPS 2024☆395Sep 26, 2024Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆51Feb 13, 2025Updated last year
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)☆17Mar 17, 2025Updated 10 months ago
- ☆13Feb 2, 2024Updated 2 years ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,627Mar 27, 2025Updated 10 months ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆481Oct 18, 2024Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]☆648Oct 29, 2024Updated last year
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,131Oct 29, 2025Updated 3 months ago
- Codes for ID-Specific Video Customized Diffusion☆462Feb 22, 2024Updated last year
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,251Feb 16, 2025Updated 11 months ago
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆45Oct 9, 2025Updated 4 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆65Oct 16, 2024Updated last year
- Query, ask and chat with a document-index via transformer models!☆17Jun 22, 2023Updated 2 years ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,876Jan 8, 2026Updated last month
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,159Dec 21, 2024Updated last year
- ☆468Feb 12, 2024Updated 2 years ago
- comfyui的InternVL2插件,InternVL2是当前不错的开源多模态大语言模型,在文档vqa上表现很好☆13Aug 10, 2024Updated last year
- [IEEE PCS 2022 best paper finalist] "FloLPIPS: A Bespoke Video Quality Metric for Frame Interpoation", Duolikun Danier, Fan Zhang, David …☆22Mar 9, 2024Updated last year
- ☆16Aug 16, 2025Updated 5 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,116Feb 7, 2025Updated last year
- ✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL☆1,113Jan 23, 2024Updated 2 years ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆947Nov 13, 2024Updated last year
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,281Feb 18, 2025Updated 11 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,153Jan 10, 2025Updated last year
- Let's finetune video generation models!☆539Sep 15, 2025Updated 5 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,510Apr 30, 2025Updated 9 months ago
- A Raspberry Pi Local AI Chatbot with a twist☆52Sep 3, 2025Updated 5 months ago
- implementation code for 'PLATE: A Prompt-Enhanced Paradigm for Multi-Scenario Recommendations' in SIGIR 2023☆13Sep 27, 2024Updated last year
- Detecting Lesion Bounding Ellipses With Gaussian Proposal Networks☆18Mar 4, 2019Updated 6 years ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Sep 3, 2024Updated last year