jinxiang-liu / UFE-AVSView external linksLinks
Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""
☆18Jul 7, 2024Updated last year
Alternatives and similar repositories for UFE-AVS
Users that are interested in UFE-AVS are comparing it to the libraries listed below
Sorting:
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆55Nov 26, 2024Updated last year
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆37Oct 11, 2024Updated last year
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆15Jul 11, 2024Updated last year
- A Simple Plugin for Transforming Images to Arbitrary Scales☆19Feb 9, 2023Updated 3 years ago
- [CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…☆40Apr 20, 2025Updated 9 months ago
- Semantic Line Combination Detector, CVPR 2024.☆24Oct 29, 2025Updated 3 months ago
- Code of the CDMAD: Class-Distribution-Mismatch-Aware Debiasing for Class-Imbalanced Semi-Supervised Learning (2024 CVPR accepted paper)☆24Mar 18, 2024Updated last year
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆19Jun 11, 2024Updated last year
- Official implementation of the CVPR 2024 paper "Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling"☆23Jun 16, 2024Updated last year
- Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)☆24Nov 8, 2021Updated 4 years ago
- [CVPR 2024 Oral] Official code for LTGC: Long-Tail Recognition via Leveraging LLMs-driven Generated Content☆22Apr 16, 2024Updated last year
- [NeurIPS 2023 Spotlight] Combating Representation Learning Disparity with Geometric Harmonization☆24May 14, 2025Updated 9 months ago
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆176Jul 7, 2025Updated 7 months ago
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆26Dec 7, 2023Updated 2 years ago
- [CVPR 2023] Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery☆27Dec 31, 2023Updated 2 years ago
- Universal Video Temporal Grounding with Generative Multi-modal Large Language Models☆46Nov 25, 2025Updated 2 months ago
- ☆27Jul 18, 2025Updated 6 months ago
- Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors☆31Jun 2, 2024Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆73Mar 6, 2025Updated 11 months ago
- This repository is a collection of awesome things about vision prompts, including papers, code, etc.☆40Dec 22, 2023Updated 2 years ago
- Code for MOVE: Unsupervised Movable Object Segmentation and Detection; NeurIPS 2022☆27Jan 25, 2023Updated 3 years ago
- MUG-V 10B: High-efficiency Training Pipeline for Large Video Generation Models☆94Dec 8, 2025Updated 2 months ago
- Source code for IEEE TPAMI 2024 "Hypergraph-Based Multi-Modal Representation for Open-Set 3D Object Retrieval"☆39Feb 2, 2024Updated 2 years ago
- [ICLR2024] SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning☆37Apr 9, 2025Updated 10 months ago
- Code for TMLR 2023 paper "OpenCon: Open-world Contrastive Learning"☆39May 11, 2023Updated 2 years ago
- Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.☆41Sep 27, 2024Updated last year
- Official implementation of "Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images" (CVPR 2024) i…☆35Nov 28, 2024Updated last year
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆19Jan 24, 2026Updated 3 weeks ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Visual Concept Connectome☆15Jun 23, 2024Updated last year
- Unified Multi-modal IAA Baseline and Benchmark☆93Sep 27, 2024Updated last year
- [ECCV 2024] Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision☆43May 23, 2025Updated 8 months ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆44Jul 11, 2024Updated last year
- [CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".☆54May 25, 2025Updated 8 months ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆45Dec 24, 2024Updated last year
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆60Jul 9, 2025Updated 7 months ago
- ☆12Dec 15, 2022Updated 3 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago