Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for acoustic-images-distillation
Users that are interested in acoustic-images-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Audio Visual Speech Recognition☆23Aug 9, 2017Updated 8 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 5 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆54Nov 18, 2019Updated 6 years ago
- ☆14Apr 18, 2019Updated 6 years ago
- ☆34Jul 25, 2018Updated 7 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆50Sep 24, 2019Updated 6 years ago
- ☆17Jul 17, 2017Updated 8 years ago
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆15Feb 9, 2026Updated last month
- Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch☆20Dec 16, 2021Updated 4 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- ☆27May 4, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Dec 5, 2023Updated 2 years ago
- Baseline of dcase 2019 task 4☆62Sep 2, 2022Updated 3 years ago
- Official implementation for AVGN☆40Mar 24, 2023Updated 3 years ago
- ☆12Mar 8, 2023Updated 3 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Jul 10, 2020Updated 5 years ago
- 给定一张身份证正、反面,识别身份证上的所有文字信息。☆10Sep 4, 2019Updated 6 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- Code for "Lifting Monocular Events to 3D Human Poses" - CVPRw 2021☆17Aug 30, 2024Updated last year
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Jul 3, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Excitation Backprop for RNNs☆15Jul 25, 2018Updated 7 years ago
- An OpenCV demo on detecting whether a person is speaking or not.☆23Mar 21, 2012Updated 14 years ago
- ☆11May 31, 2020Updated 5 years ago
- RBM+BP神经网络识别手写数字和英文字符☆11Mar 25, 2023Updated 3 years ago
- ☆12Nov 23, 2020Updated 5 years ago
- Tensorflow implementation of "Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization"[ICC…☆13Mar 29, 2019Updated 7 years ago
- PocketSphinx_Speech_Recognition☆10Aug 5, 2021Updated 4 years ago
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Feb 14, 2020Updated 6 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Nov 5, 2022Updated 3 years ago
- Third-party toolkit for Rope3D dataset☆13Jun 13, 2022Updated 3 years ago
- Implementation of "Encoraging LSTMs to Anticipate Actions Very Early", ICCV 2017☆19Mar 25, 2018Updated 8 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆37Aug 23, 2018Updated 7 years ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago