Cross-model active contrastive coding
☆22Mar 17, 2021Updated 4 years ago
Alternatives and similar repositories for CM-ACC
Users that are interested in CM-ACC are comparing it to the libraries listed below
Sorting:
- ☆14Oct 7, 2021Updated 4 years ago
- ☆11Apr 30, 2025Updated 10 months ago
- ☆30Jun 14, 2022Updated 3 years ago
- cross modal background suppression for audio-visual event localization☆36Mar 18, 2022Updated 3 years ago
- Automatic Dance-driven Music Generation☆15Jul 29, 2021Updated 4 years ago
- The audio-visual fusion method for FFIA☆26Aug 5, 2024Updated last year
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆57Apr 20, 2023Updated 2 years ago
- ☆31Sep 20, 2021Updated 4 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Official repository for "Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection", ACL Findings 2024.☆14Apr 25, 2025Updated 10 months ago
- The project is an official implementation of our paper " RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation…☆10Dec 9, 2020Updated 5 years ago
- PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…☆32Jul 8, 2024Updated last year
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 5 months ago
- Convert an image to stereographic projection (Polar Coordinates)☆10Oct 15, 2022Updated 3 years ago
- ☆11Jan 13, 2023Updated 3 years ago
- Documentation at☆14Mar 27, 2025Updated 11 months ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Jul 5, 2022Updated 3 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Dec 15, 2020Updated 5 years ago
- Cross-Modal Center Loss for 3D Cross-Modal Retrieval (CVPR2021)☆35Apr 4, 2021Updated 4 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- DAVIS web repo☆10Jan 26, 2023Updated 3 years ago
- POM: Occupancy map estimation for people detection☆10Aug 5, 2014Updated 11 years ago
- The repo host the code and model of MAViL.☆45Jul 24, 2023Updated 2 years ago
- ☆13Aug 27, 2020Updated 5 years ago
- ☆16Sep 29, 2025Updated 5 months ago
- ☆12Jun 2, 2025Updated 9 months ago
- Training code repo of the paper "DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning"☆11May 18, 2021Updated 4 years ago
- Recording of Kinect V2 Streams at 30 fps.☆10Jul 5, 2017Updated 8 years ago
- ☆14Jun 13, 2024Updated last year
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- An instruction to 1) download the Kinetics-400/Kinetics-600, 2) resize the videos, and 3) prepare annotations.☆11Jan 19, 2022Updated 4 years ago
- Motion-conditional image animation for video editing☆20Dec 2, 2023Updated 2 years ago
- Learn the technology of CUDA and GPU Programming. The main is learn from the book of "CUDA Programming: A Developer's Guide to Parallel C…☆11Aug 14, 2017Updated 8 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- Audio-Visual Speech Recognition☆20Jul 7, 2025Updated 8 months ago
- This repo contains the implementation of deep reinforcement learning (DRL) algorithms for virtual machine rescheduling in data centers.☆12Dec 2, 2022Updated 3 years ago
- Implementation of Attention-based Fusion for Multi-source Human Image Generation, S. Lathuilière, E. Sangineto, A. Siarohin, N. Sebe, WAC…☆10Oct 9, 2020Updated 5 years ago