Official Codebase for "Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers"
☆26Jun 7, 2025Updated 11 months ago
Alternatives and similar repositories for SAVs
Users that are interested in SAVs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 2, 2024Updated last year
- Official Implementation of "IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models"☆17Jun 5, 2025Updated 11 months ago
- Source-free Domain Generalization☆16Sep 24, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- ☆13May 17, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Apr 5, 2026Updated last month
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- Welcome to the official repository of Emotion-Qwen.☆26Jun 10, 2025Updated 11 months ago
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆34May 13, 2026Updated last week
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 6 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- Implementation for paper "Link Prediction on Heterophilic Graphs via Disentangled Representation Learning"☆13Aug 26, 2022Updated 3 years ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation for paper "CEPrompt: Cross-Modal Emotion-Aware Prompting for Facial Expression Recognition" (accepted to IEEE TC…☆17Oct 20, 2025Updated 7 months ago
- CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning☆30Apr 10, 2026Updated last month
- A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.☆12Dec 2, 2023Updated 2 years ago
- LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning☆77May 23, 2025Updated last year
- Flash attention implementation Minimal CUDA implementation of Flash Attention with tiled computation and online softmax. Educational imp…☆21Dec 27, 2025Updated 4 months ago
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- ☆13Jan 1, 2018Updated 8 years ago
- A CNN + Sequence to Sequence model for detecting handwriting on air☆11May 25, 2017Updated 9 years ago
- Membership Inference Attack against Graph Neural Networks☆12Nov 9, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Feb 17, 2018Updated 8 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆18Sep 11, 2024Updated last year
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆25Jul 30, 2025Updated 9 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆373Apr 20, 2025Updated last year
- The codes and data of paper "cST-ML: Continuous Spatial-Temporal Meta-Learning for Traffic Dynamics Prediction"☆10Aug 28, 2020Updated 5 years ago
- This is Pytorch implementation of our paper "LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition".☆11Sep 23, 2024Updated last year
- Nonparametric part-transfer for fine-grained recognition☆13May 13, 2014Updated 12 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22May 16, 2026Updated last week
- Official repository for the Monte Carlo guided Diffusion for Bayesian linear inverse problems paper:☆21Dec 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code accompanying our NeurIPS 2020 traffic4cast challenge☆14Oct 4, 2021Updated 4 years ago
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆29Aug 15, 2025Updated 9 months ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 11 months ago
- ☆27Mar 3, 2025Updated last year
- ☆22May 4, 2023Updated 3 years ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆30May 2, 2025Updated last year
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago