Web server to connect Kaldi speech recognizers to real-time web clients
☆17Jul 9, 2014Updated 11 years ago
Alternatives and similar repositories for kaldi-web
Users that are interested in kaldi-web are comparing it to the libraries listed below
Sorting:
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 9 years ago
- Speech recognition using webrtc for FirefoxOS☆59Feb 10, 2014Updated 12 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- Convolutional Neural Network for multitrack mix leveling☆18Jun 25, 2018Updated 7 years ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆17Jun 2, 2018Updated 7 years ago
- Ruby speech recognition with Pocketsphinx☆13May 14, 2015Updated 10 years ago
- Java API for the online speech recognition services provided by phon.ioc.ee☆18Jun 4, 2021Updated 4 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- C++ Implementation of the Information Bottleneck System☆22Jan 9, 2019Updated 7 years ago
- NWJS os x desktop based application that given a video/audio file returns a transcription using IBM Watson Speech to text API☆41Jan 9, 2017Updated 9 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Neural network model of the analog LA-2A dynamic range compressor☆23May 2, 2022Updated 3 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆21Jun 12, 2018Updated 7 years ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- Zounds is a dataflow library for building directed acyclic graphs that transform audio. It uses the featureflow library to define the pro…☆24Dec 8, 2022Updated 3 years ago
- A simplistic web app for annotating emotions in human speech video recordings.☆28Oct 13, 2014Updated 11 years ago
- A simple javascript interface to poppler library☆38Aug 23, 2025Updated 6 months ago
- Plug-in allowing you to control the polar pattern of your OC818 microphone in up to five frequency bands. Developed by Simon, Thomas, IEM…☆36Updated this week
- Participate in the 4th U.S. National Action Plan for Open Government☆13Jun 8, 2018Updated 7 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- derivative of the klatt 3.04 synthesizer☆40Dec 27, 2015Updated 10 years ago
- Analyze Emails☆11Dec 8, 2022Updated 3 years ago
- C++ Program to detect Clipping and other overload based nonlinear distortions in Wav Files☆35Feb 4, 2022Updated 4 years ago
- A library of compressor building blocks, compressors and some general utilities.☆33May 17, 2023Updated 2 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Mar 1, 2020Updated 6 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- ☆38Feb 1, 2017Updated 9 years ago
- Source code repository for the SMC paper "Musical Tempo and Key Estimation using Convolutional Neural Networks with Directional Filters".☆34Mar 24, 2023Updated 2 years ago
- Predicting breast cancer at 97.51% accuracy with Naive Bayes Classifier for learning purposes.☆13May 1, 2010Updated 15 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Nano Pi A64 firmware ( u-boot, kernel 3.10.104 / kernel 3.10.105 )☆10Jun 13, 2019Updated 6 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Implements (some of) the Flickr API using jQuery.☆45Oct 4, 2018Updated 7 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week
- A lightweight packet-level OMNeT++ simulator designed to simulate large FatTree data center networks.☆11Nov 19, 2013Updated 12 years ago