Automatic speech recognition using neural networks
☆18Nov 21, 2020Updated 5 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Apr 20, 2018Updated 7 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Oct 19, 2020Updated 5 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- ICCV 2019 Learning to Drive Competition Submission☆10Oct 21, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- The quantization of CNN/LSTM☆11Mar 26, 2017Updated 9 years ago
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- Reimplementation of NLP Style Transfer from Non-parallel Text with Adversarial Alignment (https://arxiv.org/abs/1705.09655)☆14Apr 28, 2021Updated 4 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- A Stress Annotated Dataset for Recognizing Everyday Stressors in SMS-like Conversational Systems☆14Apr 22, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Aug 27, 2018Updated 7 years ago
- ☆19Mar 15, 2024Updated 2 years ago
- Using Deviare to Cheat on Games: Intercepting Direct3D COM objects and making walls invisible☆13Jul 1, 2013Updated 12 years ago
- A proofreading tool using Google's N-gram corpus.☆12Sep 2, 2022Updated 3 years ago
- Share some recent speaker recognition papers and their implementations.☆90Sep 26, 2019Updated 6 years ago
- ☆18May 15, 2021Updated 4 years ago
- word2vec with a context based on sentences.☆15Jan 30, 2017Updated 9 years ago
- Lightweight CNN for Robust Voice Activity Detection☆20Jun 30, 2023Updated 2 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Feb 17, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hacky implementation of ppjoin by Chuan Xia et Al☆19Aug 24, 2014Updated 11 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- Utils and data sets for audio and PyTorch☆86Dec 30, 2021Updated 4 years ago
- Some tools for JSGF grammar expansion. Generate sentences from a JSGF Grammar. I originally wrote this over the course of a week, so I se…☆17Oct 6, 2025Updated 5 months ago
- Conversational AI Benchmark.☆68Jun 12, 2023Updated 2 years ago
- An unsupervised Chinese word segmentation tool.☆13May 13, 2017Updated 8 years ago
- Chinese processing☆36Jan 29, 2014Updated 12 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Mutual Modality Learning code☆15Mar 1, 2021Updated 5 years ago
- c# library for decoding K2 transducer Models,used in speech recognition (ASR)☆13Aug 20, 2025Updated 7 months ago
- Spell checker using Brill and Moore's noisy channel error model☆12Jan 9, 2019Updated 7 years ago
- Makes it easier to add CORS support to tornado apps.☆37Oct 17, 2019Updated 6 years ago
- Author implementation of "Contextualized Word Representations for Reading Comprehension" (Salant et al. 2017)☆11Jun 14, 2018Updated 7 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Poetry and rhyme,诗古,诗,诗歌☆10Mar 7, 2023Updated 3 years ago