NTU-CCA / EE6401Links
EE6401 Advanced Digital Signal Processing
☆16Updated 4 years ago
Alternatives and similar repositories for EE6401
Users that are interested in EE6401 are comparing it to the libraries listed below
Sorting:
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆437Updated last month
- soundnet and localize sound source☆11Updated 4 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆55Updated 7 months ago
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆71Updated 7 months ago
- ☆16Updated 4 months ago
- DeepEar: Sound Localization with Binaural Microphones☆12Updated last year
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆31Updated last year
- [WACV 2023] Audio-Visual Efficient Conformer (AVEC) for Robust Speech Recognition☆99Updated 2 years ago
- ☆86Updated 2 years ago
- A curated list of audio-visual learning methods and datasets.☆271Updated 10 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆43Updated 5 months ago
- ☆34Updated 11 months ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆214Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆303Updated 10 months ago
- Official repository of NeXt-TDNN for speaker verification☆78Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆146Updated last month
- Speech Separation☆76Updated last year
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Updated 2 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆152Updated 3 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆48Updated last year
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45Updated 3 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆50Updated 2 years ago
- ☆37Updated last year
- ☆66Updated last year
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆30Updated 5 years ago
- This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.☆88Updated 4 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆16Updated 7 months ago
- Official implement of SpeechFormer written in Python (PyTorch).☆81Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago