vadimkantorov/inferspeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vadimkantorov/inferspeech)

vadimkantorov / inferspeech

PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant

☆10

Alternatives and similar repositories for inferspeech

Users that are interested in inferspeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
uhh-lt / kaldi-model-server
View on GitHub
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
☆35Feb 18, 2022Updated 4 years ago
uiuc-sst / asr24
View on GitHub
24-hour Automatic Speech Recognition
☆27Jun 4, 2021Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
artbataev / end2end
View on GitHub
Losses and decoders for end-to-end ASR and OCR
☆34Oct 30, 2020Updated 5 years ago
lunixbochs / feeds
View on GitHub
transcribe audio feeds into public web ui
☆45Aug 31, 2022Updated 3 years ago
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
averkij / Word-to-Number-Russian
View on GitHub
Проект для перевода чисел, записанных в текстовом виде на русском языке.
☆11Apr 5, 2022Updated 4 years ago
cnlinxi / speech_emotion
View on GitHub
Detect emotion from audio
☆14Nov 20, 2018Updated 7 years ago
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
yc9701 / pansori-tedxkr-corpus
View on GitHub
Korean ASR Corpus generated from TEDx talks
☆27Jan 11, 2019Updated 7 years ago
xushengyuan / FastSing2
View on GitHub
An imporved version of Fastsinging singing voice synthesising system.
☆21Nov 3, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
lwang114 / UnsupTTS
View on GitHub
☆37Mar 26, 2024Updated 2 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
Gbuomprisco / zoneless-angular
View on GitHub
☆11May 7, 2023Updated 3 years ago
varunon9 / sentence-type-classifier
View on GitHub
Classify English sentences into assertive, negative, interrogative, imperative and exclamatory based on grammar.
☆20Oct 2, 2020Updated 5 years ago
deborausujono / pcfgparser
View on GitHub
Python implementation of the CYK algorithm for PCFG parsing
☆16May 12, 2014Updated 12 years ago
kevin29a / angular-janus
View on GitHub
Angular components for using the videoroom plugin from Janus Media Server
☆12Dec 15, 2020Updated 5 years ago
stas6626 / IDRnd
View on GitHub
ID R&D Voice Antispoofing Challenge Solution
☆11Jul 27, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
diff7 / tts-king
View on GitHub
a repository for trainabale tts multi speaker
☆14Nov 28, 2021Updated 4 years ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
kaituoxu / X-Punctuator
View on GitHub
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…
☆63May 13, 2020Updated 6 years ago
keenresearch / KeenASR-Android-PoC
View on GitHub
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
☆29Jun 30, 2026Updated last month
alphacep / tn2-wg
View on GitHub
Tacotron2 + Waveglow Russian
☆43Jan 11, 2020Updated 6 years ago
ikorolev72 / ffmpeg-animation-samples
View on GitHub
Several php scripts with code for image animation. Sources was done for site http://ffmpeg.unixpin.com
☆12Nov 26, 2018Updated 7 years ago
freerussianasr / recipes
View on GitHub
☆16May 7, 2018Updated 8 years ago