☆211Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for train_your_own_sora
Users that are interested in train_your_own_sora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,929Oct 30, 2025Updated 4 months ago
- ☆10Apr 24, 2024Updated last year
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆47Oct 9, 2025Updated 5 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- NeurIPS 2024☆395Sep 26, 2024Updated last year
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆676Oct 25, 2024Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆35Jan 2, 2026Updated 2 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆51Feb 13, 2025Updated last year
- Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)☆482Oct 18, 2024Updated last year
- Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]☆650Oct 29, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Apr 7, 2024Updated last year
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,630Mar 27, 2025Updated last year
- Papers and codes collection for customized, personalized and editable generative models☆28Oct 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,174Dec 21, 2024Updated last year
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆12,149Mar 8, 2026Updated 2 weeks ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- Let's finetune video generation models!☆547Sep 15, 2025Updated 6 months ago
- ☆470Feb 12, 2024Updated 2 years ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- Codes for ID-Specific Video Customized Diffusion☆462Feb 22, 2024Updated 2 years ago
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆177Feb 27, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An unofficial PyTorch implementation of "Learning Loss for Active Learning"☆10Apr 3, 2022Updated 3 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- End-to-end natural language to SQL system: schema-aware model fine-tuning, retrieval-augmented prompting, and production-grade CLI, power…☆22Aug 1, 2025Updated 7 months ago
- Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)☆17Mar 17, 2025Updated last year
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,344Mar 8, 2026Updated 2 weeks ago
- ✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL☆1,113Jan 23, 2024Updated 2 years ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆951Nov 13, 2024Updated last year
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,131Feb 7, 2025Updated last year
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆434Nov 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆296May 17, 2025Updated 10 months ago
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆144Jan 22, 2024Updated 2 years ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,728Apr 30, 2025Updated 10 months ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated last year
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.☆1,050Aug 21, 2024Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,903Jan 8, 2026Updated 2 months ago