Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for acoustic-images-distillation
Users that are interested in acoustic-images-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Audio Visual Speech Recognition☆23Aug 9, 2017Updated 8 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- ☆14Apr 18, 2019Updated 7 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆50Sep 24, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆17Jul 17, 2017Updated 8 years ago
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆15Feb 9, 2026Updated 2 months ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- ☆23Dec 5, 2023Updated 2 years ago
- Baseline of dcase 2019 task 4☆62Sep 2, 2022Updated 3 years ago
- Road extraction with deep learning from high resolution satellite images.☆13Sep 16, 2021Updated 4 years ago
- Official implementation for AVGN☆41Mar 24, 2023Updated 3 years ago
- 给定一张身份证正、反面,识别身份证上的所有文字信息☆10Sep 4, 2019Updated 6 years ago
- The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.☆12Oct 15, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- UPC Deep Learning for Speech and Language 2018☆17Feb 26, 2018Updated 8 years ago
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Jul 3, 2024Updated last year
- Excitation Backprop for RNNs☆15Jul 25, 2018Updated 7 years ago
- Scripts to download and explore the How2Sign dataset. If you have any questions, please contact: amanda.duarte@upc.edu☆26Jan 25, 2023Updated 3 years ago
- Detects lip movement and check if a person is speaking☆19May 4, 2018Updated 7 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- ☆11May 31, 2020Updated 5 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Oct 28, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆55Jun 3, 2020Updated 5 years ago
- ☆42Aug 8, 2021Updated 4 years ago
- This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…☆10Jul 19, 2017Updated 8 years ago
- RBM+BP神经网络识别手写数字和英文字符☆11Mar 25, 2023Updated 3 years ago
- 端到端的中文场景文字识别。☆12Jun 27, 2022Updated 3 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆15Jul 2, 2020Updated 5 years ago
- Code for the paper Learning Unbiased Representations via Mutual Information Backpropagation☆21Mar 23, 2020Updated 6 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Nov 5, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- Third-party toolkit for Rope3D dataset☆13Jun 13, 2022Updated 3 years ago
- Implementation of "Encoraging LSTMs to Anticipate Actions Very Early", ICCV 2017☆19Mar 25, 2018Updated 8 years ago
- A TensorFlow implementation of dependency-based word embeddings (dependency-based word2vec)☆12Jan 26, 2016Updated 10 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago