kyegomez / Sora
Implementation of the premier Text to Video model from OpenAI
☆57Updated 3 months ago
Alternatives and similar repositories for Sora:
Users that are interested in Sora are comparing it to the libraries listed below
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆84Updated last year
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆38Updated 3 weeks ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆68Updated 2 months ago
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆110Updated last month
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 3 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆44Updated 2 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆46Updated 5 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆74Updated 3 months ago
- Modern Stable Diffusion models family - Fluently☆29Updated 8 months ago
- ☆25Updated 8 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆99Updated 11 months ago
- ☆32Updated 3 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Official Implementation of weights2weights☆138Updated 2 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆50Updated 3 weeks ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆40Updated 7 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆47Updated last week
- An attempt at a SVD inpainting pipeline☆51Updated last year
- ☆30Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆128Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆68Updated 8 months ago
- Synthetic data generator for image, video and 3D models☆30Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆157Updated 4 months ago
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆23Updated last year
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated 7 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆77Updated 6 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆232Updated 6 months ago