☆214Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for train_your_own_sora
Users that are interested in train_your_own_sora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,931Oct 30, 2025Updated 5 months ago
- ☆10Apr 24, 2024Updated last year
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆49Oct 9, 2025Updated 6 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆21Jan 26, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- NeurIPS 2024☆397Sep 26, 2024Updated last year
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆685Oct 25, 2024Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆51Feb 13, 2025Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆35Jan 2, 2026Updated 3 months ago
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆484Oct 18, 2024Updated last year
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]☆650Oct 29, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆163Apr 7, 2024Updated 2 years ago
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,631Mar 27, 2025Updated last year
- Papers and codes collection for customized, personalized and editable generative models☆28Oct 1, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,252Feb 16, 2025Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,182Dec 21, 2024Updated last year
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,154Mar 8, 2026Updated last month
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- Let's finetune video generation models!☆547Sep 15, 2025Updated 7 months ago
- ☆81May 14, 2025Updated 11 months ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- ☆470Feb 12, 2024Updated 2 years ago
- Codes for ID-Specific Video Customized Diffusion☆461Feb 22, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An unofficial PyTorch implementation of "Learning Loss for Active Learning"☆10Apr 3, 2022Updated 4 years ago
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆177Feb 27, 2024Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated 2 years ago
- Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)☆17Mar 17, 2025Updated last year
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,344Mar 8, 2026Updated last month
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆952Nov 13, 2024Updated last year
- ✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL☆1,113Jan 23, 2024Updated 2 years ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,131Feb 7, 2025Updated last year
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆433Nov 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆296May 17, 2025Updated 11 months ago
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆143Jan 22, 2024Updated 2 years ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,880Apr 9, 2026Updated last week
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆17Mar 23, 2025Updated last year
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.☆1,048Aug 21, 2024Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,910Jan 8, 2026Updated 3 months ago