A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.
☆51Nov 22, 2025Updated 6 months ago
Alternatives and similar repositories for voice-type-classifier
Users that are interested in voice-type-classifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACLEW Diarization Virtual Machine☆34Jul 29, 2019Updated 6 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆20Jul 17, 2023Updated 2 years ago
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆19Nov 27, 2024Updated last year
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Python package for the management of day-long recordings of children.☆16Apr 22, 2026Updated last month
- Materials for LOT School 2023, "Language Learning: A Data-Driven Approach"☆14Aug 14, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆61Oct 7, 2020Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆31Oct 3, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- KA2(花京院と青葉2)『その問題,やっぱり数理モデルが解決します』の資料です☆35Aug 7, 2022Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 5 years ago
- Python package for combining diarization system outputs.☆94Oct 12, 2023Updated 2 years ago
- ☆17Mar 26, 2023Updated 3 years ago
- 책 읽어주는 딥러닝을 보고 나도 만들고 싶어져서 공부하며 만드는 repository입니다.☆10Dec 8, 2022Updated 3 years ago
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- Classify the emotions from variable-length speech segments☆11Mar 29, 2018Updated 8 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 3 years ago
- One prompt. A full AI engineering team. Go lie on the couch. 🧠☆98Apr 18, 2026Updated last month
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- pytorch implementation of wavenet autoencoder https://arxiv.org/pdf/1704.01279.pdf☆12Jul 25, 2018Updated 7 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- CROW: A Self-Supervised Crop Row Navigation Algorithm for Agricultural Fields☆15Jan 20, 2026Updated 4 months ago
- ☆11Mar 12, 2019Updated 7 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆55Oct 17, 2023Updated 2 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.☆13Jun 11, 2021Updated 4 years ago
- This is a Javascript toolbox to perform online rating studies with auditory material.☆18Nov 18, 2024Updated last year
- A list of papers for child ASR☆54Oct 8, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- 2019 PyCon kr tutorial: "네이버 영화 평점 데이터로 자연어처리 논문 구현 시작하기"☆13Aug 21, 2019Updated 6 years ago