kyegomez / Sora
Implementation of the premier Text to Video model from OpenAI
☆57Updated 4 months ago
Alternatives and similar repositories for Sora:
Users that are interested in Sora are comparing it to the libraries listed below
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆78Updated 3 weeks ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆41Updated last week
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆174Updated 7 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆68Updated 3 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated last month
- Modern Stable Diffusion models family - Fluently☆29Updated 9 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆117Updated 3 weeks ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆100Updated 8 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 4 months ago
- ☆53Updated 2 years ago
- faster parallel inference of mochi-1 video generation model☆112Updated last month
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆102Updated this week
- An attempt at a SVD inpainting pipeline☆51Updated last year
- Official Implementation of weights2weights☆140Updated 3 weeks ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆60Updated this week
- Distilling Diversity and Control in Diffusion Models☆29Updated 2 weeks ago
- ☆83Updated 7 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆79Updated 8 months ago
- Official PyTorch implementation of TokenSet.☆88Updated last week
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆23Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆71Updated 9 months ago
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 2 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆53Updated 2 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆49Updated 3 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆98Updated last week
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆48Updated last week
- ☆36Updated 6 months ago