AgnesMayYao / Infant-Crying-DetectionLinks
☆29Updated 2 years ago
Alternatives and similar repositories for Infant-Crying-Detection
Users that are interested in Infant-Crying-Detection are comparing it to the libraries listed below
Sorting:
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆34Updated 3 years ago
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆16Updated 11 months ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆91Updated 3 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆46Updated 3 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆60Updated 10 months ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆21Updated 2 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆150Updated 3 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆174Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆132Updated 5 months ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆131Updated this week
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 2 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆39Updated 11 months ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆21Updated 10 months ago
- ☆108Updated 2 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆80Updated 2 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆39Updated 4 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆41Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- ☆30Updated last year
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆56Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆89Updated 2 years ago
- ☆65Updated 8 months ago
- Source code for Consistent ensemble distillation for audio tagging☆32Updated 10 months ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆28Updated last year
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆67Updated 2 months ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆44Updated 3 years ago