Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-recognition
Users that are interested in Speaker-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Turing machine for source separation in Tensorflow☆18Aug 16, 2017Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- Code for the Kaggle crowdflower competition☆13Dec 16, 2016Updated 9 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- C++ (OpenCV) implementation of the Unsupervised Feature Learning algorithm of Adam Coates and Andrew Ng for Scene Text Detection and Reco…☆14Jun 25, 2015Updated 10 years ago
- Speaker diarization scripts, based on AaltoASR☆192Jan 3, 2019Updated 7 years ago
- Python scripts and other resources for tesing DetectNet on Nvidia DIGITS☆14Oct 10, 2017Updated 8 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Arabic roots list resource☆12Aug 24, 2018Updated 7 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Dec 6, 2017Updated 8 years ago
- Input is a scanned or photographed image of handwritten text and output will be characters stored in image format.☆12Sep 21, 2016Updated 9 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Single-channel blind source separation☆48Feb 5, 2018Updated 8 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Deep Neural Network for Speaker Count Estimation☆157Sep 5, 2020Updated 5 years ago
- Based EAST implements "Self-organized Text Detection with Minimal Post-processing via Border Learning"☆16Nov 7, 2018Updated 7 years ago
- Experiments for paper untitlted☆14Jul 25, 2020Updated 5 years ago
- Arabic Text Detection in Images☆15Apr 5, 2018Updated 8 years ago
- node.js client for nsq☆24Jan 9, 2017Updated 9 years ago
- Batch Guetzli compressions in a manageable fashion☆16Sep 8, 2019Updated 6 years ago
- ☆10Jun 24, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Archiver & backup program with fault tolerant compression☆28Apr 12, 2024Updated 2 years ago
- ☆17Aug 21, 2018Updated 7 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- ☆22Dec 6, 2018Updated 7 years ago
- Remove noise from sound clips by use of supervised training and an ideal ratio mask.☆14Apr 2, 2019Updated 7 years ago
- Tensorflow re-implementation of the recognition part the paper "Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Rec…☆24Aug 31, 2018Updated 7 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Sep 21, 2022Updated 3 years ago
- Fast Double Metaphone in C++11☆21Aug 26, 2014Updated 11 years ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- Perform exploration, navigation and coverage path planning covering a room with UV energy with the Turtlebot3☆15Jul 31, 2022Updated 3 years ago
- General Navigation Models based on GNM, ViNT, NoMaD as a pytorch repo for quick and easy deployment☆15Nov 18, 2024Updated last year
- Tools for speech processing, keyword spotting☆16Mar 11, 2020Updated 6 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago