openai / consistency_models
Official repo for consistency models.
☆6,239Updated 10 months ago
Alternatives and similar repositories for consistency_models:
Users that are interested in consistency_models are comparing it to the libraries listed below
- ImageBind One Embedding Space to Bind Them All☆8,504Updated 6 months ago
- An open-source framework for training large multimodal models.☆3,816Updated 5 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,187Updated 10 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,807Updated 11 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10,229Updated 2 months ago
- An open source implementation of CLIP.☆10,958Updated last month
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,033Updated this week
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,008Updated 6 months ago
- ☆7,742Updated 10 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆9,788Updated 6 months ago
- Painter & SegGPT Series: Vision Foundation Models from BAAI☆2,547Updated 2 months ago
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.☆8,345Updated this week
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,479Updated 5 months ago
- ☆3,219Updated 9 months ago
- [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.☆3,158Updated 3 weeks ago
- ☆6,521Updated 7 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆11,275Updated last month
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆6,802Updated 8 months ago
- Karras et al. (2022) diffusion models for PyTorch☆2,387Updated last month
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,396Updated last year
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆15,712Updated 5 months ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆3,880Updated 6 months ago
- Taming Transformers for High-Resolution Image Synthesis☆6,000Updated 6 months ago
- Open-Set Grounded Text-to-Image Generation☆2,072Updated 11 months ago
- High-Resolution Image Synthesis with Latent Diffusion Models☆12,304Updated 11 months ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,859Updated 8 months ago
- Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI…☆6,606Updated 8 months ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,116Updated last year
- [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation☆4,293Updated last year
- Consistency Distilled Diff VAE☆2,155Updated last year