idansc / discriminative_class_tokensView external linksLinks
☆41Mar 27, 2024Updated last year
Alternatives and similar repositories for discriminative_class_tokens
Users that are interested in discriminative_class_tokens are comparing it to the libraries listed below
Sorting:
- ☆15Apr 12, 2024Updated last year
- ☆41Sep 25, 2023Updated 2 years ago
- ☆31Jan 7, 2024Updated 2 years ago
- DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactu…☆10Oct 9, 2024Updated last year
- ☆72Oct 14, 2023Updated 2 years ago
- Official implementation of SGDiff (ACM MM '23)☆37Nov 26, 2023Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- ☆11Nov 30, 2025Updated 2 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Mar 5, 2024Updated last year
- ☆10Nov 21, 2023Updated 2 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆31Nov 1, 2024Updated last year
- ☆24Sep 12, 2023Updated 2 years ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆109Nov 24, 2025Updated 2 months ago
- Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation☆49Mar 26, 2023Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- ViT models pretrained with up to ~5k hours of human-like video data☆14Aug 10, 2023Updated 2 years ago
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated last month
- [TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…☆53Nov 29, 2024Updated last year
- [ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts☆18May 22, 2025Updated 8 months ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Aug 11, 2022Updated 3 years ago
- [TACL] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- ☆16Feb 21, 2025Updated 11 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆67Aug 14, 2024Updated last year
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".☆12Oct 11, 2024Updated last year
- ☆16Jun 14, 2024Updated last year
- ☆14Oct 16, 2023Updated 2 years ago
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆17Jul 1, 2024Updated last year
- ☆17Jul 23, 2024Updated last year
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 6 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Nov 23, 2023Updated 2 years ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models☆21May 29, 2025Updated 8 months ago
- [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆113Apr 18, 2024Updated last year
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆46Nov 2, 2023Updated 2 years ago
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆384Jan 24, 2024Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆311Nov 1, 2024Updated last year
- TraDiffusion: Trajectory-Based Training-Free Image Generation☆54Nov 10, 2024Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year