oscarknagg / raw-audio-gender-classification
Machine learning experiment to perform gender classification from raw audio.
☆23Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for raw-audio-gender-classification
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 3 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Various algorithms for voice activity detection☆22Updated 7 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- Official Implementation of Mockingjay in Pytorch☆52Updated last year
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆36Updated last year
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- ☆38Updated 2 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 4 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- ☆37Updated 3 years ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Updated last year