An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech diarization.
☆14Apr 12, 2021Updated 5 years ago
Alternatives and similar repositories for android-speech-diarization
Users that are interested in android-speech-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆21Jun 12, 2018Updated 7 years ago
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 9 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19May 16, 2015Updated 10 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Deep Learning for Speech Recogntion based on Theano☆15Jul 28, 2017Updated 8 years ago
- This repository allows to use kaldi to train an i-vector extractor and extract i-vectors through a python interface.☆11Nov 27, 2017Updated 8 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Web server to connect Kaldi speech recognizers to real-time web clients☆17Jul 9, 2014Updated 11 years ago
- ☆15Jan 24, 2017Updated 9 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 5 years ago
- Java API for the online speech recognition services provided by phon.ioc.ee☆18Jun 4, 2021Updated 4 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Speaker Identification using GMM and Speech Recognition using HMMs☆38Apr 7, 2014Updated 12 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 3 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 7 years ago
- ASR library☆14Dec 3, 2018Updated 7 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 11 years ago
- A series of Playful demos that take advantage of WebRTC, Web Audio API and other new HTML5/JS technologies.☆24Jun 9, 2014Updated 11 years ago
- MATLAB functions for training and evaluating HMMs and GMMs.☆22Jan 6, 2010Updated 16 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- voxel editing tool with OpenVDB☆13Dec 31, 2018Updated 7 years ago
- A javascript library for creating and rendering (with THREE.js) voxel objects.☆18Oct 19, 2014Updated 11 years ago
- Educational tutorials for speech and language processing classes☆12Jan 8, 2019Updated 7 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- ☆14Aug 9, 2018Updated 7 years ago
- This is now the official location of the Kaldi project.☆13Jun 10, 2019Updated 6 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago