kyegomez / SoraLinks
Implementation of the premier Text to Video model from OpenAI
☆56Updated 6 months ago
Alternatives and similar repositories for Sora
Users that are interested in Sora are comparing it to the libraries listed below
Sorting:
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆13Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 8 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated 11 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆128Updated last year
- ☆24Updated last year
- ☆85Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- Finetune any model on HF in less than 30 seconds☆58Updated 2 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 6 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆110Updated 3 months ago
- ☆75Updated 8 months ago
- Inference-time scaling of diffusion-based image and video generation models.☆144Updated 3 months ago
- An official implementation of SwapAnyone.☆62Updated 2 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆34Updated 11 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆48Updated 8 months ago
- Official PyTorch implementation of TokenSet.☆121Updated 2 months ago
- ☆60Updated last year
- A simple reproducible template to implement AI research papers☆24Updated 8 months ago
- ☆71Updated 7 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆180Updated 10 months ago
- Minimal Differentiable Image Reward Functions☆57Updated last month
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated 11 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆278Updated 2 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆70Updated 6 months ago
- Tools for content datamining and NLP at scale☆43Updated 11 months ago
- An attempt at a SVD inpainting pipeline☆49Updated last year
- ☆171Updated last year
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated 9 months ago