jsonBackup / Latent-CLIP-DemoLinks
☆14Updated 4 months ago
Alternatives and similar repositories for Latent-CLIP-Demo
Users that are interested in Latent-CLIP-Demo are comparing it to the libraries listed below
Sorting:
- ☆21Updated 7 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago
- Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model☆25Updated 3 weeks ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆17Updated 6 months ago
- RS-IMLE☆41Updated 7 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 8 months ago
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 8 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆12Updated 4 months ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆100Updated 3 weeks ago
- ☆26Updated 9 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 5 months ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆18Updated 4 months ago
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆27Updated 8 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆30Updated 2 months ago
- [ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion☆21Updated last year
- The official repo of continuous speculative decoding☆27Updated 3 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆22Updated 9 months ago
- ☆21Updated 2 years ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆24Updated 2 months ago
- Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…☆16Updated last year
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆19Updated last month
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆18Updated last year
- Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".☆25Updated 3 months ago
- [CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing☆19Updated last year
- ☆15Updated last year
- Official repo for StyleMe3D☆23Updated 2 months ago
- ☆70Updated 8 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated 3 months ago