Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-recognition
Users that are interested in Speaker-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Turing machine for source separation in Tensorflow☆18Aug 16, 2017Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Speaker diarization scripts, based on AaltoASR☆192Jan 3, 2019Updated 7 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Deep neural models for core NLP tasks☆13Nov 9, 2017Updated 8 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Dec 6, 2017Updated 8 years ago
- Single-channel blind source separation☆48Feb 5, 2018Updated 8 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Deep Neural Network for Speaker Count Estimation☆157Sep 5, 2020Updated 5 years ago
- Experiments for paper untitlted☆14Jul 25, 2020Updated 5 years ago
- ☆10Jun 24, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Remove noise from sound clips by use of supervised training and an ideal ratio mask.☆14Apr 2, 2019Updated 7 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- Visual Relocalization on the COLMAP reconstruction model☆14Nov 21, 2024Updated last year
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- ☆14Sep 21, 2022Updated 3 years ago
- Fast Double Metaphone in C++11☆21Aug 26, 2014Updated 11 years ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tools for speech processing, keyword spotting☆16Mar 11, 2020Updated 6 years ago
- ☆12Nov 9, 2018Updated 7 years ago
- ☆18Oct 14, 2022Updated 3 years ago
- End-to-end speech recognition using TensorFlow☆48Apr 2, 2018Updated 8 years ago
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Mar 8, 2022Updated 4 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago
- ☆12Oct 9, 2025Updated 7 months ago
- Using Deep Learning for singing voice separation - Project for the course DT2119 Speech and Speaker Recognition offered by KTH in 2018☆15Jun 16, 2018Updated 7 years ago
- Sequence.js Theme - A minimalist theme for showcasing products☆10Aug 21, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- STT Service based on Kaldi ASR☆15Aug 17, 2018Updated 7 years ago
- An eXample Programming Language☆11Dec 20, 2018Updated 7 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Jul 28, 2017Updated 8 years ago
- A zero-shot relation extractor, easily downloadable from the HuggingFace repo.☆12Aug 13, 2021Updated 4 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Jun 2, 2019Updated 6 years ago
- 以音素建模构建NN-CTC声学模型☆15May 14, 2019Updated 6 years ago
- Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)☆11Nov 29, 2024Updated last year