Segment speech sequences based on speaker transitions, using ML and DSP.
☆17Jul 30, 2018Updated 7 years ago
Alternatives and similar repositories for Speaker-recognition
Users that are interested in Speaker-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Turing machine for source separation in Tensorflow☆18Aug 16, 2017Updated 8 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Python 3 compatible softphone with support for audio streaming.☆14Apr 18, 2024Updated 2 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- ☆10Aug 5, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the Kaggle crowdflower competition☆13Dec 16, 2016Updated 9 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- A CNN + Sequence to Sequence model for detecting handwriting on air☆11May 25, 2017Updated 9 years ago
- a boilerplate removal algorithm☆12Mar 22, 2016Updated 10 years ago
- Sample app to help creating zip file to be trained for Einstein Vision Object Detection☆13Jun 11, 2018Updated 8 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 8 years ago
- Create dynamic web scraper in Objective-C or Ruby!☆24Mar 28, 2015Updated 11 years ago
- Retrieve simplified versions of webpages, powered by Mozilla's Readability.js☆15Oct 14, 2018Updated 7 years ago
- WIP. A directed graph editor with React, Redux and D3.js☆11Oct 3, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Drop-in component for adding additional keyboard keys to both iPad/iPhone keyboards.☆167Apr 18, 2017Updated 9 years ago
- Code for Font Classification Networks☆13Sep 11, 2017Updated 8 years ago
- Speaker diarization scripts, based on AaltoASR☆191Jan 3, 2019Updated 7 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Deep neural models for core NLP tasks☆13Nov 9, 2017Updated 8 years ago
- Code for https://arxiv.org/abs/1712.00254☆17Dec 6, 2017Updated 8 years ago
- Input is a scanned or photographed image of handwritten text and output will be characters stored in image format.☆12Sep 21, 2016Updated 9 years ago
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Single-channel blind source separation☆48Feb 5, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Flask Skeleton project with Sass and Coffeescropt☆13Feb 16, 2016Updated 10 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Extends jQuery UI Draggable to add Multi Element Drag and Live functionality☆30Dec 13, 2016Updated 9 years ago
- Deep Neural Network for Speaker Count Estimation☆157Sep 5, 2020Updated 5 years ago
- Based EAST implements "Self-organized Text Detection with Minimal Post-processing via Border Learning"☆16Nov 7, 2018Updated 7 years ago
- ☆18Oct 16, 2013Updated 12 years ago
- Arabic Text Detection in Images☆15Apr 5, 2018Updated 8 years ago
- Parsing PDF files with PDFium☆12Nov 7, 2024Updated last year
- node.js client for nsq☆24Jan 9, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Batch Guetzli compressions in a manageable fashion☆16Sep 8, 2019Updated 6 years ago
- ☆10Jun 24, 2020Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Swift PDF objects. Doing my part to help us stay out of the headache that is Core Foundation.☆15Mar 2, 2017Updated 9 years ago
- PDF Reader in JavaScript☆16Oct 2, 2012Updated 13 years ago
- ☆22Dec 6, 2018Updated 7 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago