☆219Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for train_your_own_sora
Users that are interested in train_your_own_sora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,943Oct 30, 2025Updated 7 months ago
- ☆10Apr 24, 2024Updated 2 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆54Oct 9, 2025Updated 8 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆21Jan 26, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- NeurIPS 2024☆396Sep 26, 2024Updated last year
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆692Oct 25, 2024Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆51Feb 13, 2025Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆36Jan 2, 2026Updated 5 months ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆485Oct 18, 2024Updated last year
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]☆654Oct 29, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆163Apr 7, 2024Updated 2 years ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,631Mar 27, 2025Updated last year
- Papers and codes collection for customized, personalized and editable generative models☆28Oct 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,251Feb 16, 2025Updated last year
- [TMLR] Video Generation Models: A Survey of Post-Training and Alignment | 🔥 A continuously updated collection of papers, datasets, and b…☆159Jun 10, 2026Updated last week
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,160Mar 8, 2026Updated 3 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,193Dec 21, 2024Updated last year
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- Let's finetune video generation models!☆547Sep 15, 2025Updated 9 months ago
- Codes for ID-Specific Video Customized Diffusion☆459Feb 22, 2024Updated 2 years ago
- ☆470Feb 12, 2024Updated 2 years ago
- An unofficial PyTorch implementation of "Learning Loss for Active Learning"☆10Apr 3, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆177Feb 27, 2024Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated 2 years ago
- Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)☆18Mar 17, 2025Updated last year
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,343Apr 14, 2026Updated 2 months ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆952Nov 13, 2024Updated last year
- ✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL☆1,113Jan 23, 2024Updated 2 years ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,130Feb 7, 2025Updated last year
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆434Nov 10, 2024Updated last year
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆296May 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆142Jan 22, 2024Updated 2 years ago
- Open-Sora: Democratizing Efficient Video Production for All☆29,095Apr 9, 2026Updated 2 months ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆17Mar 23, 2025Updated last year
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.☆1,049Aug 21, 2024Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,953Jan 8, 2026Updated 5 months ago
- Stable Video Diffusion Training Code and Extensions.☆732Jul 25, 2024Updated last year