AgnesMayYao / Infant-Crying-DetectionLinks
☆31Updated 2 years ago
Alternatives and similar repositories for Infant-Crying-Detection
Users that are interested in Infant-Crying-Detection are comparing it to the libraries listed below
Sorting:
- This repository includes the code to reproduce our paper "End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification A…☆90Updated last year
- Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-T…☆70Updated 3 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆138Updated 2 weeks ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆151Updated 3 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆177Updated last year
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆62Updated last year
- General purpose sound recognition demo☆158Updated last year
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆23Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆135Updated 7 months ago
- Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"☆126Updated 11 months ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆93Updated 3 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆293Updated 8 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆39Updated last year
- ☆109Updated 3 years ago
- ☆50Updated last year
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆20Updated last year
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆211Updated 2 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆90Updated 2 years ago
- ☆57Updated 11 months ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆44Updated 3 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆170Updated 2 years ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆42Updated 4 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆114Updated 2 years ago
- This repository includes the code to reproduce our paper "End-to-end anti-spoofing with RawNet2" (https://arxiv.org/abs/2011.01108) publi…☆60Updated 2 years ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆60Updated last year
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆82Updated 3 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆205Updated 2 years ago
- ☆17Updated 8 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆147Updated 2 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆44Updated last year