kyegomez / Sora
Implementation of the premier Text to Video model from OpenAI
☆57Updated 6 months ago
Alternatives and similar repositories for Sora
Users that are interested in Sora are comparing it to the libraries listed below
Sorting:
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆179Updated 9 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆109Updated 2 months ago
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆35Updated 3 months ago
- ☆74Updated 7 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆41Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆132Updated 7 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆168Updated 3 weeks ago
- An attempt at a SVD inpainting pipeline☆51Updated last year
- A simple reproducible template to implement AI research papers☆24Updated 8 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆47Updated 8 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆68Updated 5 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 5 months ago
- Distilling Diversity and Control in Diffusion Models☆39Updated 2 weeks ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆38Updated 10 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆132Updated 2 years ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆74Updated last month
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆43Updated last month
- ☆30Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated last year
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 3 months ago
- ☆25Updated 11 months ago
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆65Updated last year
- ☆83Updated 8 months ago
- Implementation of a framework for Genie2 in Pytorch☆146Updated 4 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆55Updated last month
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆101Updated last year