afperezm/acoustic-images-distillation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/afperezm/acoustic-images-distillation)

afperezm / acoustic-images-distillation

Code for the paper: Audio-Visual Model Distillation Using Acoustic Images

☆21

Alternatives and similar repositories for acoustic-images-distillation

Users that are interested in acoustic-images-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ankitshah009 / WALNet-Weak_Label_Analysis
View on GitHub
Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.
☆32Sep 13, 2023Updated 2 years ago
lzuwei / ip-avsr
View on GitHub
Audio Visual Speech Recognition
☆23Aug 9, 2017Updated 8 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
yujmo / CZU_MHAD
View on GitHub
CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors
☆26Jun 2, 2022Updated 4 years ago
hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
View on GitHub
☆54Nov 18, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
georgesterpu / Taris
View on GitHub
Transformer-based online speech recognition system with TensorFlow 2
☆26Jan 22, 2021Updated 5 years ago
firasl / BoCF
View on GitHub
Official implementation of 'Bag of Color Features For Color Constancy' (BoCF) accepted in IEEE Transactions on Image Processing (TIP) 202…
☆13Mar 7, 2022Updated 4 years ago
jgomezpe / unalcol
View on GitHub
Unified Algorithm Collection
☆10Apr 28, 2019Updated 7 years ago
dialogtekgeek / AudioVisualSceneAwareDialog
View on GitHub
☆27May 4, 2020Updated 6 years ago
arunmallya / openreview_helper
View on GitHub
Python scripts to help ACs with OpenReview
☆11Feb 7, 2026Updated 5 months ago
PiercingDan / kaggle-dstl
View on GitHub
Kaggle Competition Dstl Satellite Imagery Feature Detection
☆10Apr 1, 2017Updated 9 years ago
ricvolpi / certified-distributional-robustness
View on GitHub
(Unofficial) Code for the paper "Certifying Some Distributional Robustness with Principled Adversarial Training"
☆13May 31, 2018Updated 8 years ago
Orion-AI-Lab / PrototypeInSAR
View on GitHub
☆13Sep 29, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
idansc / simple-avsd
View on GitHub
Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``
☆27May 26, 2020Updated 6 years ago
turpaultn / DCASE2019_task4
View on GitHub
Baseline of dcase 2019 task 4
☆61Sep 2, 2022Updated 3 years ago
zhangzhao156 / Human-Activity-Recognition-Codes-Datasets
View on GitHub
The comparsion methods code
☆12Mar 7, 2022Updated 4 years ago
zhoujuncc1 / shenjingcat
View on GitHub
☆14Mar 8, 2023Updated 3 years ago
epic-kitchens / C1-Action-Recognition
View on GitHub
Evaluation metrics and submission file creation scripts the Action Recognition challenge
☆15Feb 9, 2026Updated 5 months ago
sbargal / Caffe-ExcitationBP-RNNs
View on GitHub
Excitation Backprop for RNNs
☆15Jul 25, 2018Updated 8 years ago
jhuang81 / weak-sup-visual-grounding
View on GitHub
The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
☆12Oct 15, 2021Updated 4 years ago
mdfahimhasan / Global-Subsidence-Groundwater
View on GitHub
☆17Nov 9, 2023Updated 2 years ago
telecombcn-dl / 2018-dlsl
View on GitHub
UPC Deep Learning for Speech and Language 2018
☆17Feb 26, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bilel-bj / unsupervised-domain-adaptation-gan
View on GitHub
Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images
☆14May 31, 2020Updated 6 years ago
wonchulSon / DGKD
View on GitHub
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
☆11Oct 10, 2021Updated 4 years ago
SSahuDS / Lipreading-Using-Mutimodal-Speech-Recognition
View on GitHub
Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…
☆15Jul 27, 2023Updated 3 years ago
WisleyWang / DC-AI-LipReading
View on GitHub
☆11May 31, 2020Updated 6 years ago
yashshah / LipReader
View on GitHub
An OpenCV demo on detecting whether a person is speaking or not.
☆23Mar 21, 2012Updated 14 years ago
JonghwanMun / MarioQA
View on GitHub
Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017
☆10Oct 28, 2025Updated 9 months ago
qiuqiangkong / sound_event_detection_dcase2017_task4
View on GitHub
☆55Jun 3, 2020Updated 6 years ago
arpane4c5 / ActivityNet
View on GitHub
This is my attempt at the ActivityNet Challenge 2017. Thanks to the organizers for providing the boilerplate code and annotated datasets.…
☆10Jul 19, 2017Updated 9 years ago
svip-lab / Weekly_Group_Meeting_Paper_List
View on GitHub
☆42Aug 8, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
luanshiyinyang / ChineseOCR
View on GitHub
端到端的中文场景文字识别。
☆12Jun 27, 2022Updated 4 years ago
Lenvia / RBM-BP-character-recognition
View on GitHub
RBM+BP神经网络识别手写数字和英文字符
☆11Mar 25, 2023Updated 3 years ago
William-N-Havard / SpeechCoco
View on GitHub
☆12Nov 23, 2020Updated 5 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
goddoe / hide-and-seek
View on GitHub
Tensorflow implementation of "Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization"[ICC…
☆13Mar 29, 2019Updated 7 years ago
rugrag / learn-unbiased
View on GitHub
Code for the paper Learning Unbiased Representations via Mutual Information Backpropagation
☆21Mar 23, 2020Updated 6 years ago
AI-Research-BD / Keyword-MLP
View on GitHub
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Nov 5, 2022Updated 3 years ago