MsterDC / CVM-DL_BaseLinks
Some basic topics in the field of deep learning, including papers, notes and codes, etc., hope to be helpful to later people.
☆20Updated last year
Alternatives and similar repositories for CVM-DL_Base
Users that are interested in CVM-DL_Base are comparing it to the libraries listed below
Sorting:
- Easy wrapper for inserting LoRA layers in CLIP.☆40Updated last year
- assistant tools for attention visualization in deep learning☆29Updated 3 years ago
- ☆257Updated 2 years ago
- 中科大数字图像分析(周文罡、李厚强等)2022秋学期复习资料☆23Updated 2 years ago
- A Collection of AIGC Research Groups☆89Updated 2 months ago
- ☆138Updated last year
- ☆282Updated last year
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆53Updated 4 months ago
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆18Updated 2 months ago
- 视觉实验室新手任务☆156Updated last year
- [ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"☆184Updated 2 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆102Updated 3 weeks ago
- 这里陈列了我编写的一些关于深度学习的画图工具,如果觉得有帮助可以给个star.☆42Updated 3 years ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆19Updated 9 months ago
- Collection of Highlight papers☆42Updated last year
- Official PyTorch Code for Anchor Token Guided Prompt Learning Methods: [ICCV 2025] ATPrompt and [Arxiv 2511.21188] AnchorOPT☆121Updated last month
- ☆94Updated 2 years ago
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆80Updated last year
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆154Updated 10 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆347Updated 3 weeks ago
- Unofficial code for VPT(Visual Prompt Tuning) paper of arxiv 2203.12119☆164Updated 2 years ago
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆81Updated 4 months ago
- Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)☆167Updated 2 years ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Updated 10 months ago
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆20Updated 7 months ago
- Idempotent Generative Network's unofficial pytorch implementation☆46Updated 2 years ago
- This repository maintains a collection of important papers on conditional image synthesis with diffusion models (Survey Paper published…☆175Updated 7 months ago
- [AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark☆24Updated 3 weeks ago
- Uni-OVSeg is a weakly supervised open-vocabulary segmentation framework that leverages unpaired mask-text pairs.☆53Updated last year
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆78Updated 2 months ago