kvablack / LLaVA-serverLinks
☆22Updated last year
Alternatives and similar repositories for LLaVA-server
Users that are interested in LLaVA-server are comparing it to the libraries listed below
Sorting:
- ☆65Updated last month
- Reproduction of DDPO paper (RLHF for diffusion)☆90Updated 2 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆74Updated last year
- JAX implementation ViT-VQGAN☆83Updated 3 years ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆83Updated 9 months ago
- ☆54Updated 11 months ago
- ☆31Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆85Updated last year
- Train vector quantized CLIP models using pytorch lightning☆20Updated last year
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago
- ☆121Updated 7 months ago
- Fast and controllable text-to-image model.☆40Updated 2 years ago
- Iterable datapipelines for pytorch training.☆88Updated last year
- Official codebase for the Paper “Retrieval-Augmented Diffusion Models”☆136Updated 2 years ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆68Updated 3 months ago
- Training code for CLIP-FlanT5☆29Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆134Updated 2 years ago
- A Video Tokenizer Evaluation Dataset☆133Updated 8 months ago
- ☆50Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆94Updated 9 months ago
- Retrieval augmented diffusion from CompVis.☆53Updated 3 years ago
- ☆73Updated 2 years ago
- ☆86Updated last year
- VQVAE for video prediction☆27Updated 3 years ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39Updated last year
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆299Updated 10 months ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆120Updated 2 years ago