guyyariv / vLMIGLinks
This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generation
☆17Updated last year
Alternatives and similar repositories for vLMIG
Users that are interested in vLMIG are comparing it to the libraries listed below
Sorting:
- LVAS-Agent Code Base☆22Updated 9 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Updated 9 months ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Updated 2 years ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆72Updated 3 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆65Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆79Updated last year
- ☆41Updated 8 months ago
- Train vector quantized CLIP models using pytorch lightning☆20Updated last year
- ☆41Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆48Updated last year
- ☆13Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Updated last year
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Updated last year
- ☆22Updated 2 years ago
- ☆39Updated 2 years ago
- ☆26Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆51Updated last year
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Updated 2 years ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆32Updated 2 months ago
- Explore how to get a VQ-VAE models efficiently!☆67Updated 6 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Updated 10 months ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).☆85Updated 9 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)☆34Updated 6 months ago
- The official repo of continuous speculative decoding☆31Updated 10 months ago
- An official PyTorch implementation for CLIPPR☆30Updated 2 years ago
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆30Updated 10 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆79Updated last year
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆74Updated 4 months ago