Automatic speech recognition using neural networks
☆18Nov 21, 2020Updated 5 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- Simple TensorFlow implementation of skip-thought vectors☆11Dec 8, 2022Updated 3 years ago
- ☆15Apr 20, 2018Updated 7 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Download and preperation tool for free speech corpora.☆16Apr 28, 2019Updated 6 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- ☆18May 15, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- Conversational AI Benchmark.☆68Jun 12, 2023Updated 2 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Nov 16, 2018Updated 7 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆249Jul 15, 2025Updated 7 months ago
- Python measurement platform for the NanoElectronics group☆10Mar 4, 2021Updated 5 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Feb 17, 2022Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- ☆13Updated this week
- rabitq rust implementation☆10Feb 4, 2026Updated last month
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Feb 27, 2026Updated last week
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆10Jul 24, 2019Updated 6 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Utils and data sets for audio and PyTorch☆86Dec 30, 2021Updated 4 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- ☆37Nov 22, 2025Updated 3 months ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago