AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)
☆34May 8, 2024Updated 2 years ago
Alternatives and similar repositories for AAPL
Users that are interested in AAPL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing [ICCV 2025]☆30Feb 9, 2026Updated 3 months ago
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆50Jul 18, 2023Updated 2 years ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆15Mar 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- Neural network for creating distortion while keeping embeddings as close as possible☆20Feb 6, 2024Updated 2 years ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆64Mar 1, 2025Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated 11 months ago
- ☆37Feb 6, 2023Updated 3 years ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆179Jun 10, 2024Updated last year
- ☆32Sep 3, 2024Updated last year
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆14Apr 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple python package to stretch audio files and change their speed☆12Feb 18, 2026Updated 2 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆48Jun 12, 2025Updated 10 months ago
- Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion☆48Feb 21, 2025Updated last year
- ☆25Sep 19, 2023Updated 2 years ago
- ☆13Apr 13, 2026Updated 3 weeks ago
- Awesome Vision-Language Pretraining Papers☆42Jan 15, 2025Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆90Jul 21, 2024Updated last year
- ☆10Jan 29, 2019Updated 7 years ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆81Mar 6, 2026Updated 2 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆43May 20, 2024Updated last year
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Mar 20, 2023Updated 3 years ago
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- ☆20Apr 23, 2024Updated 2 years ago
- Implementation of BadCLIP https://arxiv.org/pdf/2311.16194.pdf☆24Mar 23, 2024Updated 2 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago
- multimodal change detection☆48Sep 20, 2024Updated last year
- This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..☆14Jul 1, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- [ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning☆15Aug 2, 2023Updated 2 years ago
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆16Feb 2, 2026Updated 3 months ago
- [MICCAI2023 Early Accept] Multi-scale Cross-restoration Framework for Electrocardiogram Anomaly Detection☆68Nov 15, 2024Updated last year
- Wave Partial Differential Equation Solver in Python☆14Jun 5, 2024Updated last year
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆19Sep 16, 2024Updated last year