AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)
☆34May 8, 2024Updated 2 years ago
Alternatives and similar repositories for AAPL
Users that are interested in AAPL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HazeFlow: Revisit Haze Physical Model as ODE and Non-Homogeneous Haze Generation for Real-World Dehazing [ICCV 2025]☆32Feb 9, 2026Updated 3 months ago
- The official implementation of paper Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. If you find our code or paper use…☆50Jul 18, 2023Updated 2 years ago
- ☆21Nov 9, 2025Updated 6 months ago
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- ☆17Oct 1, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆38Jul 19, 2024Updated last year
- [ICLR 2026] Official code of "Segment any Events with Language"☆48Apr 10, 2026Updated last month
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆15Mar 26, 2025Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆64Mar 1, 2025Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated 11 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Sep 13, 2024Updated last year
- VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models☆36Apr 9, 2025Updated last year
- ☆32Sep 3, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Dec 6, 2025Updated 5 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆48Jun 12, 2025Updated 11 months ago
- Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion☆48Feb 21, 2025Updated last year
- ☆25Sep 19, 2023Updated 2 years ago
- ☆13Apr 13, 2026Updated last month
- Source code to reproduce results from Panoptic Swiftnet paper.☆16Oct 18, 2022Updated 3 years ago
- Awesome Vision-Language Pretraining Papers☆42Jan 15, 2025Updated last year
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year
- ☆10Jan 29, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- [ICCV 2025] SALAD -- Semantics-Aware Logical Anomaly Detection☆45Oct 3, 2025Updated 7 months ago
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆82Mar 6, 2026Updated 2 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Mar 20, 2023Updated 3 years ago
- multimodal change detection☆48Sep 20, 2024Updated last year
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- Browser automation for creating new pages in WordPress☆13Jun 7, 2025Updated 11 months ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Jan 22, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of deep semi-supervised learning resources☆13Jul 1, 2022Updated 3 years ago
- [ICLR 2023] Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning☆15Aug 2, 2023Updated 2 years ago
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks☆56Mar 7, 2026Updated 2 months ago
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆16Feb 2, 2026Updated 3 months ago
- [MICCAI2023 Early Accept] Multi-scale Cross-restoration Framework for Electrocardiogram Anomaly Detection☆68Nov 15, 2024Updated last year
- [IJCV 2025] MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆78May 30, 2025Updated 11 months ago
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆19Sep 16, 2024Updated last year