MsterDC / CVM-DL_BaseLinks
Some basic topics in the field of deep learning, including papers, notes and codes, etc., hope to be helpful to later people.
☆20Updated last year
Alternatives and similar repositories for CVM-DL_Base
Users that are interested in CVM-DL_Base are comparing it to the libraries listed below
Sorting:
- Easy wrapper for inserting LoRA layers in CLIP.☆35Updated last year
- [ICCV 2025] Official PyTorch Code for "Advancing Textual Prompt Learning with Anchored Attributes"☆83Updated 3 weeks ago
- 视觉实验室新手任务☆155Updated last year
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆43Updated last year
- ☆250Updated last year
- 中科大数字图像分析(周文罡、李厚强等)2022秋学期复习资料☆17Updated 2 years ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆14Updated 4 months ago
- This repository is the official code for the paper "AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation" (NeurIPS 2024).☆13Updated 10 months ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆22Updated 5 months ago
- Uni-OVSeg is a weakly supervised open-vocabulary segmentation framework that leverages unpaired mask-text pairs.☆52Updated last year
- ☆67Updated 3 months ago
- A Collection of AIGC Research Groups☆78Updated 2 weeks ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆46Updated 6 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆64Updated last week
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆321Updated last week
- ☆135Updated last year
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆134Updated last month
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…☆109Updated this week
- A Collection of Papers and Codes for CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC☆574Updated 2 weeks ago
- ☆94Updated 2 years ago
- Unified the Anonymous and Camera Ready Version, hope everyone can get an ACCEPT☆260Updated last month
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆268Updated last week
- 抢占显卡☆75Updated 9 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆149Updated 4 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆37Updated 4 months ago
- 这里陈列了我编写的一些关于深度学习的画图工具,如果觉得有帮助可以给个star.☆41Updated 2 years ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆636Updated 2 weeks ago
- [AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark☆15Updated 3 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆145Updated 6 months ago
- [ACMMM 2024] Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors☆24Updated 9 months ago