[AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?
☆25Aug 5, 2025Updated 6 months ago
Alternatives and similar repositories for DisCLIP
Users that are interested in DisCLIP are comparing it to the libraries listed below
Sorting:
- ☆13Jun 3, 2024Updated last year
- [WACV 2025] DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence☆35Jul 10, 2025Updated 7 months ago
- ☆23Oct 15, 2024Updated last year
- ☆31Dec 8, 2023Updated 2 years ago
- ☆21Jun 3, 2023Updated 2 years ago
- WIP☆94Aug 13, 2024Updated last year
- ☆17Aug 7, 2024Updated last year
- Fine-Grained Subject-Specific Attribute Expression Control in T2I Models☆134Feb 27, 2025Updated last year
- Free-form flows are a generative model training a pair of neural networks via maximum likelihood☆50Jun 26, 2025Updated 8 months ago
- Source code for the paper "Improving Deep Metric Learning byDivide and Conquer"☆21Dec 10, 2021Updated 4 years ago
- Rough LLM Interpreter of ComfyUI☆28Jan 23, 2025Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆62Dec 10, 2024Updated last year
- ☆107Oct 23, 2024Updated last year
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Mar 4, 2025Updated 11 months ago
- ☆171Jan 8, 2026Updated last month
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆24Nov 6, 2024Updated last year
- Modelling complex vector drawings with Stroke-Clouds☆27Apr 30, 2024Updated last year
- [ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching☆256Oct 17, 2025Updated 4 months ago
- Wildly unsound and experimental sampling for ComfyUI☆29Aug 9, 2025Updated 6 months ago
- Official model implementation and benchmark evaluation repository of <AnyEdit: Unified High-Quality Image Edit with Any Idea>☆31Jul 18, 2025Updated 7 months ago
- ☆123Oct 14, 2024Updated last year
- 🚀 A powerful library for efficient training of Neural Fields at scale.☆30Jan 22, 2024Updated 2 years ago
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- An official implementation of CVPR 2023 "Self-Guided Diffusion Models"☆28Jun 2, 2023Updated 2 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Janky implementation of DiffuseHigh for ComfyUI☆36May 6, 2025Updated 9 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆80Sep 13, 2024Updated last year
- DepthFM: Fast Monocular Depth Estimation with Flow Matching☆87May 22, 2024Updated last year
- ☆35Nov 25, 2025Updated 3 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- ☆17Feb 4, 2026Updated 3 weeks ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Concurrency library☆16Oct 13, 2024Updated last year
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Mar 11, 2025Updated 11 months ago
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆44Nov 19, 2025Updated 3 months ago
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Sep 9, 2024Updated last year
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year