awni/future_speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/awni/future_speech)

awni / future_speech

The History of Speech Recognition to the Year 2030

☆13

Alternatives and similar repositories for future_speech

Users that are interested in future_speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
awni / automata_ml
View on GitHub
An Introduction to Weighted Automata in Machine Learning
☆64Sep 3, 2022Updated 3 years ago
chinshr / sctk
View on GitHub
Speech Recognition Scoring Toolkit
☆13Sep 30, 2015Updated 10 years ago
keonlee9420 / Comprehensive-Tacotron2
View on GitHub
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…
☆49Jul 31, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TEAMuP-dev / audacitorch
View on GitHub
PyTorch wrappers for using your model in audacity!
☆181Aug 13, 2023Updated 2 years ago
snir-david / CPP-Design-Patterns
View on GitHub
Some design patterns implements in C++.
☆10Aug 14, 2024Updated last year
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆24Jul 25, 2021Updated 4 years ago
pzelasko / kaldialign
View on GitHub
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆70Jun 15, 2026Updated last month
iamxiaoyubei / Voice-Tech-Study
View on GitHub
语音识别语音前端处理语音合成语音转换等等语音技术的资料汇总
☆23Nov 8, 2019Updated 6 years ago
EGO4D / audio-visual
View on GitHub
☆69Sep 13, 2022Updated 3 years ago
shangeth / wavencoder
View on GitHub
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆92Jun 6, 2021Updated 5 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
1ytic / warp-rna
View on GitHub
Recurrent Neural Aligner
☆51Apr 14, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 2 months ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
fakufaku / auxiva-ipa
View on GitHub
Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.
☆36Mar 22, 2021Updated 5 years ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
QEDan / links_clustering
View on GitHub
Implementation of the Links Online Clustering algorithm: https://arxiv.org/abs/1801.10123
☆30May 13, 2026Updated 2 months ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
UniversalDependencies / UD_English-PUD
View on GitHub
Parallel Universal Dependencies.
☆14Jul 10, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
vvestman / pytorch-ivectors
View on GitHub
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…
☆63Oct 15, 2019Updated 6 years ago
nomonosound / fast-align-audio
View on GitHub
A fast python library for aligning similar audio snippets passed in as NumPy arrays
☆50Oct 27, 2025Updated 8 months ago
csteinmetz1 / pyloudnorm-eval
View on GitHub
Evaluation of a number of loudness meter implementations
☆13Aug 28, 2021Updated 4 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
hankcs / iparser
View on GitHub
Yet another dependency parser, integrated with tokenizer, tagger and visualization tool.
☆11Mar 18, 2018Updated 8 years ago
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
m3yrin / nar-latent-alignment
View on GitHub
Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437
☆23Jun 14, 2020Updated 6 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
souvikg544 / TTS_Data_Maker
View on GitHub
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Mar 14, 2023Updated 3 years ago
felixSchober / ABSA-Transformer
View on GitHub
This is the repository for my NLP master thesis with the title Transfer and Multitask Learning for Aspect-Based Sentiment Analysis Using …
☆12Mar 24, 2023Updated 3 years ago
seungheondoh / speech-to-music
View on GitHub
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Aug 16, 2023Updated 2 years ago
apple / visatronic-demo
View on GitHub
Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis
☆15May 28, 2025Updated last year
106368015AlvinYang / Taiwanese-Food-101
View on GitHub
☆11Aug 3, 2020Updated 5 years ago