eric-ai-lab/Discffusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eric-ai-lab/Discffusion)

eric-ai-lab / Discffusion

Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"

☆29

Alternatives and similar repositories for Discffusion

Users that are interested in Discffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eric-ai-lab / ComCLIP
View on GitHub
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆38Aug 18, 2024Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆12Apr 12, 2026Updated last month
VITA-Group / instant_soup
View on GitHub
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Nov 28, 2023Updated 2 years ago
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yahoojapan / srgd
View on GitHub
Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]
☆19Dec 9, 2024Updated last year
camenduru / marigold-lcm-hf
View on GitHub
☆12Mar 25, 2024Updated 2 years ago
aiming-lab / ReAgent-V
View on GitHub
[NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding
☆52Sep 21, 2025Updated 8 months ago
ugorsahin / Generative-Negative-Mining
View on GitHub
[WACV 2024] Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024
☆13Jan 3, 2024Updated 2 years ago
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
Nanne / ProtoSim
View on GitHub
Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison
☆18Dec 15, 2023Updated 2 years ago
filtir / awesome-AI-fact-checking
View on GitHub
A collection of papers tackling automatic fact-checking (particularly of AI-generated content)
☆13Nov 3, 2023Updated 2 years ago
SivanDoveh / TSVLC
View on GitHub
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆47Sep 25, 2023Updated 2 years ago
deeplearning-wisc / mllmshift-emi
View on GitHub
Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"
☆12May 27, 2025Updated 11 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ajd12342 / why-winoground-hard
View on GitHub
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31May 29, 2023Updated 2 years ago
ttchengab / continuous_3d_words_code
View on GitHub
☆66Jun 27, 2024Updated last year
haoningwu3639 / SimpleSDM-Video
View on GitHub
A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.
☆20Feb 15, 2024Updated 2 years ago
eric-ai-lab / MMWorld
View on GitHub
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
☆28Jul 15, 2025Updated 10 months ago
umd-huang-lab / perceptionCLIP
View on GitHub
Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"
☆80May 5, 2024Updated 2 years ago
BatsResearch / csp
View on GitHub
Learning to compose soft prompts for compositional zero-shot learning.
☆95Sep 13, 2025Updated 8 months ago
ykarmesh / stable-control-representations
View on GitHub
Code for Stable Control Representations
☆26Apr 5, 2025Updated last year
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
FactoDeepLearning / MultitaskVLFM
View on GitHub
☆25Aug 1, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hassanhub / R3Transformer
View on GitHub
Official python implementation of R3-Transformer
☆15Nov 30, 2020Updated 5 years ago
eric-ai-lab / CPL
View on GitHub
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Dec 5, 2022Updated 3 years ago
chingyaoc / debias_vl
View on GitHub
Code for Debiasing Vision-Language Models via Biased Prompts
☆60May 16, 2023Updated 3 years ago
yandex-research / vqdm
View on GitHub
Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper
☆34Sep 17, 2024Updated last year
cloneofsimo / repa-rf
View on GitHub
☆33Nov 4, 2024Updated last year
kdariina / CLIP-not-BoW-unimodally
View on GitHub
Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"
☆29Feb 27, 2026Updated 2 months ago
weathon / VSF
View on GitHub
Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip
☆38Jan 27, 2026Updated 3 months ago
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
ylqi / GL-RG
View on GitHub
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
☆18May 10, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mihirp1998 / AlignProp
View on GitHub
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆320Nov 1, 2024Updated last year
XmYx / tinyvae-flux
View on GitHub
☆33Aug 9, 2024Updated last year
mrwu-mac / ControlMLLM
View on GitHub
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆209Jul 17, 2025Updated 10 months ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated last week
divyakraman / HawkI2024
View on GitHub
Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View
☆13Jun 5, 2024Updated last year
NeuralSamurAI / ComfyUI-Dimensional-Latent-Perlin
View on GitHub
Create Latents with Perlin Noise in any shape (dimensionality). Works with Flux, SD3 and other 16d latent models.
☆33Aug 6, 2024Updated last year
sIncerass / MVLPT
View on GitHub
code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720
☆57Jun 5, 2024Updated last year