AgnesMayYao / Infant-Crying-DetectionLinks
☆30Updated 2 years ago
Alternatives and similar repositories for Infant-Crying-Detection
Users that are interested in Infant-Crying-Detection are comparing it to the libraries listed below
Sorting:
- Source code for Consistent ensemble distillation for audio tagging☆34Updated 2 weeks ago
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆16Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆44Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆34Updated 3 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- ☆65Updated 9 months ago
- ☆52Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆142Updated 2 years ago
- General purpose sound recognition demo☆157Updated last year
- Speech Separation☆64Updated last year
- ☆26Updated 3 years ago
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆44Updated last year
- A summary of speech data augment algorithms☆69Updated 4 years ago
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- ☆84Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆20Updated 11 months ago
- ☆13Updated last year
- ☆30Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2023 challenge☆53Updated 2 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆90Updated 2 years ago
- Speech Dereverberation using Fully Convolutional Networks☆72Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆138Updated 11 months ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆114Updated 2 years ago