jinxiang-liu/UFE-AVS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jinxiang-liu/UFE-AVS)

jinxiang-liu / UFE-AVS

Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""

☆19

Alternatives and similar repositories for UFE-AVS

Users that are interested in UFE-AVS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jinxiang-liu / anno-free-AVS
View on GitHub
Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"
☆38Oct 11, 2024Updated last year
Code-kunkun / ZS-CIR
View on GitHub
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
☆55Nov 26, 2024Updated last year
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
Lipurple / ARIS
View on GitHub
A Simple Plugin for Transforming Images to Arbitrary Scales
☆19Feb 9, 2023Updated 3 years ago
MediaBrain-SJTU / OC_LT
View on GitHub
Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024
☆19Jul 11, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
xuchengjian632 / UMind
View on GitHub
A Unified MultItask Network for zero-shot M/EEG visual Decoding (referred to UMind), including visual stimulus retrieval, classification,…
☆19Jun 11, 2026Updated last month
SII-Ferenas / PGSeg
View on GitHub
This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"
☆27Dec 7, 2023Updated 2 years ago
Code-kunkun / LamRA
View on GitHub
[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
☆182Jul 7, 2025Updated last year
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
VisualAIKHU / NoPrior_MultiSSL
View on GitHub
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)
☆17Sep 1, 2024Updated last year
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58Updated this week
Lzq5 / UniTime
View on GitHub
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
☆56May 20, 2026Updated 2 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
leonsick / depthg
View on GitHub
Official implementation of the CVPR 2024 paper "Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling"
☆24Jun 16, 2024Updated 2 years ago
zyxia1009 / CVPR2024-TSPNet
View on GitHub
(CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization
☆20Jun 11, 2024Updated 2 years ago
meder411 / Spherical-Package
View on GitHub
The backend code for my projects associated with spherical images
☆17Oct 12, 2021Updated 4 years ago
FanZhichen / NCD-IIC
View on GitHub
[CVPR 2023] Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery
☆27Dec 31, 2023Updated 2 years ago
vvvb-github / AVSegFormer
View on GitHub
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
☆74Mar 6, 2025Updated last year
p-sachin / audio_reconstruction_project
View on GitHub
A research based project which uses steganography and ML/deep learning algorithm to reconstruct the lost audio signals from a corrupted f…
☆12Dec 5, 2022Updated 3 years ago
haoningwu3639 / SimpleSDM-3
View on GitHub
A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.
☆27May 28, 2025Updated last year
shan18 / Depth-Estimation-Segmentation
View on GitHub
Model for Monocular Depth Estimation and Image Segmentation
☆14Jul 31, 2021Updated 4 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
HCIS-Lab / Action-slot
View on GitHub
[CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes
☆25Apr 28, 2025Updated last year
qirui-chen / RGA3-release
View on GitHub
[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring
☆24Aug 8, 2025Updated 11 months ago
SII-Ferenas / CPN
View on GitHub
Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)
☆24Nov 8, 2021Updated 4 years ago
mrwu-mac / ControlMLLM
View on GitHub
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆211Jul 17, 2025Updated last year
chenyiyuu / CA-Jaccard
View on GitHub
[CVPR 2024] CA-Jaccard: Camera-aware Jaccard Distance for Person Re-identification
☆30Oct 28, 2024Updated last year
RyannDaGreat / peekaboo
View on GitHub
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
☆31Jun 2, 2024Updated 2 years ago
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
cjddny / cocos2d_guardCarrot
View on GitHub
cocos2d-x 保卫萝卜 C++
☆11Mar 1, 2016Updated 10 years ago
nerves-project-attic / nerves_network_interface
View on GitHub
Discover, setup, and get stats on network interfaces
☆11Nov 17, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FannyChao / AVS360_audiovisual_saliency_360
View on GitHub
Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio
☆20Dec 28, 2021Updated 4 years ago
merlresearch / SMART
View on GitHub
Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"
☆11Aug 10, 2023Updated 2 years ago
donghao51 / AEO
View on GitHub
[ICLR 2025] Towards Robust Multimodal Open-set Test-time Adaptation via Adaptive Entropy-aware Optimization
☆27May 23, 2025Updated last year
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
yunbeizhang / DPCore
View on GitHub
[ICML 2025] DPCore: Dynamic Prompt Coreset for Continual Test-Time Adaptation
☆30Feb 27, 2026Updated 5 months ago
deeplearning-wisc / opencon
View on GitHub
Code for TMLR 2023 paper "OpenCon: Open-world Contrastive Learning"
☆38May 11, 2023Updated 3 years ago
victkk / 3DGS_SLAM_mobile_app
View on GitHub
☆18Oct 18, 2025Updated 9 months ago