The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
Alternatives and similar repositories for future_speech
Users that are interested in future_speech are comparing it to the libraries listed below
Sorting:
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- An Introduction to Weighted Automata in Machine Learning☆64Sep 3, 2022Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- Some design patterns implements in C++.☆10Aug 14, 2024Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Jul 31, 2023Updated 2 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆22Nov 8, 2019Updated 6 years ago
- Awesome Automatic Speech Recognition (ASR) paper collection☆22Sep 4, 2020Updated 5 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆23Jun 14, 2020Updated 5 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Implementation of the Links Online Clustering algorithm: https://arxiv.org/abs/1801.10123☆30Oct 9, 2021Updated 4 years ago
- ☆28Oct 7, 2025Updated 4 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- Solving the inverse kinematics problem of a 3 Link Planar Manipulator using neural networks.☆10Jul 19, 2020Updated 5 years ago
- ☆10Jun 21, 2022Updated 3 years ago
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- ☆67Sep 13, 2022Updated 3 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆34Mar 22, 2021Updated 4 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Apr 11, 2023Updated 2 years ago
- ☆33Aug 19, 2019Updated 6 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- ☆41Oct 16, 2025Updated 4 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Oct 27, 2025Updated 4 months ago
- Another reverse proxy that provides authentication with OpenID Connect☆10Jul 10, 2023Updated 2 years ago
- ☆12Nov 8, 2023Updated 2 years ago
- Startup equity calculator☆12Dec 11, 2019Updated 6 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆10Aug 3, 2020Updated 5 years ago
- A Kong plugin that allows access to an upstream url through a forward proxy (eg. squid).☆11Apr 30, 2018Updated 7 years ago
- ☆10Sep 30, 2022Updated 3 years ago