Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for acoustic-images-distillation
Users that are interested in acoustic-images-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow code for the paper 'Modality Distillation with Multiple Stream Networks for Action Recognition', ECCV 2018☆19May 2, 2019Updated 7 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 6 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Apr 18, 2019Updated 7 years ago
- ☆34Jul 25, 2018Updated 7 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆50Sep 24, 2019Updated 6 years ago
- ☆17Jul 17, 2017Updated 8 years ago
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆15Feb 9, 2026Updated 3 months ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- ☆10Jun 1, 2023Updated 2 years ago
- Python scripts to help ACs with OpenReview☆11Feb 7, 2026Updated 3 months ago
- ☆23Dec 5, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Kaggle Competition Dstl Satellite Imagery Feature Detection☆10Apr 1, 2017Updated 9 years ago
- Official implementation of 'Bag of Color Features For Color Constancy' (BoCF) accepted in IEEE Transactions on Image Processing (TIP) 202…☆13Mar 7, 2022Updated 4 years ago
- ☆13Sep 29, 2023Updated 2 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 6 years ago
- Unified Algorithm Collection☆10Apr 28, 2019Updated 7 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- Official implementation for AVGN☆41Mar 24, 2023Updated 3 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆84Jul 10, 2020Updated 5 years ago
- 给定一张身份证正、反面,识别身份证上的所有文字信息☆10Sep 4, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- Implementation of an out-of-distribution detection method for geospatial deployments and its related experiments.☆29Jan 8, 2025Updated last year
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Detects lip movement and check if a person is speaking☆19May 4, 2018Updated 8 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 3 years ago
- ☆27Apr 30, 2025Updated last year
- Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images☆14May 31, 2020Updated 5 years ago
- Densely Guided Knowledge Distillation using Multiple Teacher Assistants☆11Oct 10, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An OpenCV demo on detecting whether a person is speaking or not.☆23Mar 21, 2012Updated 14 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Oct 28, 2025Updated 7 months ago
- ☆55Jun 3, 2020Updated 5 years ago
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Jul 19, 2017Updated 8 years ago
- RBM+BP神经网络识别手写数字和英文字符☆11Mar 25, 2023Updated 3 years ago
- 端到端的中文场景文字识别。☆12Jun 27, 2022Updated 3 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago