kyegomez / SoraLinks
Implementation of the premier Text to Video model from OpenAI
☆57Updated 7 months ago
Alternatives and similar repositories for Sora
Users that are interested in Sora are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆103Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 4 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆116Updated 4 months ago
- Distilling Diversity and Control in Diffusion Models☆41Updated last month
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆44Updated 3 months ago
- faster parallel inference of mochi-1 video generation model☆121Updated 4 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 5 months ago
- Fine-tune of Florence-2 for shot categorization.☆24Updated 3 months ago
- ☆24Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆113Updated 3 months ago
- Official code for infimm-hd☆16Updated 9 months ago
- An official implementation of SwapAnyone.☆62Updated 3 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆34Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆15Updated 7 months ago
- Official PyTorch implementation of TokenSet.☆121Updated 3 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆40Updated 2 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆48Updated 9 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 8 months ago
- An attempt at a SVD inpainting pipeline☆50Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- Modern Stable Diffusion models family - Fluently☆32Updated last year
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆47Updated 3 months ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆33Updated 4 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆69Updated 6 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆158Updated last year
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model☆105Updated 3 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆87Updated 3 months ago