Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-recognition
Users that are interested in Speaker-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Turing machine for source separation in Tensorflow☆18Aug 16, 2017Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- PyTorch Implementation of Noise2Noise☆11Aug 24, 2018Updated 7 years ago
- Jabalín is an application for generating verbs in Modern Standard Arabic. The application is implemented in python language version 3. Th…☆12Jul 12, 2015Updated 10 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- C++ (OpenCV) implementation of the Unsupervised Feature Learning algorithm of Adam Coates and Andrew Ng for Scene Text Detection and Reco…☆14Jun 25, 2015Updated 10 years ago
- Speaker diarization scripts, based on AaltoASR☆191Jan 3, 2019Updated 7 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Arabic roots list resource☆12Aug 24, 2018Updated 7 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Dec 6, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Single-channel blind source separation☆48Feb 5, 2018Updated 8 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Deep Neural Network for Speaker Count Estimation☆157Sep 5, 2020Updated 5 years ago
- Based EAST implements "Self-organized Text Detection with Minimal Post-processing via Border Learning"☆16Nov 7, 2018Updated 7 years ago
- Experiments for paper untitlted☆14Jul 25, 2020Updated 5 years ago
- Arabic Text Detection in Images☆15Apr 5, 2018Updated 7 years ago
- ☆10Jun 24, 2020Updated 5 years ago
- Archiver & backup program with fault tolerant compression☆28Apr 12, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- ☆22Dec 6, 2018Updated 7 years ago
- Remove noise from sound clips by use of supervised training and an ideal ratio mask.☆14Apr 2, 2019Updated 6 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- ☆14Sep 21, 2022Updated 3 years ago
- Fast Double Metaphone in C++11☆21Aug 26, 2014Updated 11 years ago
- Perform exploration, navigation and coverage path planning covering a room with UV energy with the Turtlebot3☆15Jul 31, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- General Navigation Models based on GNM, ViNT, NoMaD as a pytorch repo for quick and easy deployment☆14Nov 18, 2024Updated last year
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 6 years ago
- End-to-end speech recognition using TensorFlow☆49Apr 2, 2018Updated 7 years ago
- ☆12Nov 9, 2018Updated 7 years ago
- ☆18Oct 14, 2022Updated 3 years ago
- ☆10Oct 9, 2025Updated 5 months ago
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Mar 8, 2022Updated 4 years ago