A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.
☆50Nov 22, 2025Updated 3 months ago
Alternatives and similar repositories for voice-type-classifier
Users that are interested in voice-type-classifier are comparing it to the libraries listed below
Sorting:
- ACLEW Diarization Virtual Machine☆34Jul 29, 2019Updated 6 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆18Jul 17, 2023Updated 2 years ago
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- ☆19Nov 27, 2024Updated last year
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Materials for LOT School 2023, "Language Learning: A Data-Driven Approach"☆14Aug 14, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- KA2(花京院と青葉2)『その問題,やっぱり数理モデルが解決します』の資料です☆35Aug 7, 2022Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.☆19Nov 21, 2022Updated 3 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Lightweight CNN for Robust Voice Activity Detection☆20Jun 30, 2023Updated 2 years ago
- IRL implementation based on Norvig's AIMA code.☆14May 2, 2014Updated 11 years ago
- A simple pyaudio microphone interface☆11Jul 27, 2018Updated 7 years ago
- Tutorial session material of Pytest in PyCon KR 2019☆10Apr 11, 2020Updated 5 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- For accessing to the dataset, please send your short bio and objective of the study to Dr.Theerawit Wilaiprasitporn (theerawit dot w at v…☆14Apr 29, 2021Updated 4 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 3 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Sort vim folds based on their first line.☆13May 19, 2019Updated 6 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- ☆52Oct 17, 2023Updated 2 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.☆13Jun 11, 2021Updated 4 years ago
- This is a Javascript toolbox to perform online rating studies with auditory material.☆18Nov 18, 2024Updated last year
- A list of papers for child ASR☆52Oct 8, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago