pschaldenbrand / Text2VideoLinks
A fast approach for translating a series of text prompts into a video. The 2022 NeurIPS Workshop on Machine Learning for Creativity and Design
☆32Updated 2 years ago
Alternatives and similar repositories for Text2Video
Users that are interested in Text2Video are comparing it to the libraries listed below
Sorting:
- Controlling diffusion-based image generation with just a few strokes☆64Updated last year
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- [NeurIPS 2022: Score-Based Modeling Workshop] Multiresolution Textual Inversion☆99Updated 2 years ago
- ☆65Updated 2 months ago
- Retrieval augmented diffusion from CompVis.☆53Updated 3 years ago
- ☆28Updated last year
- A tool for benchmarking image generation models.☆33Updated 2 years ago
- ☆73Updated 2 years ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆69Updated 8 months ago
- RS-IMLE☆42Updated 8 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- CLIP Guided Diffusion☆69Updated last year
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 3 years ago
- ☆24Updated 2 years ago
- ☆14Updated 2 years ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆49Updated 6 months ago
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆53Updated last year
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆74Updated last year
- ☆46Updated 2 weeks ago
- ☆16Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 9 months ago
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆72Updated 3 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆26Updated last year
- ☆29Updated 2 years ago
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 2 years ago
- ☆32Updated 9 months ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆40Updated 2 years ago