alphacep / vosk-textLinks

☆8

Alternatives and similar repositories for vosk-text

Users that are interested in vosk-text are comparing it to the libraries listed below

Sorting:

ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
JSALT2022CodeSwitchingASR / generating-code-switched-audio
☆12Updated 5 months ago
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
Zhongxu-Wang / ArtSpeech
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆18Updated 2 months ago
skhu101 / Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…
☆9Updated 3 years ago
MiniXC / phones
A collection of utilities for handling IPA phones.
☆25Updated last year
pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 9 months ago
utter-project / mHuBERT-147-scripts
Collection of scripts from mHuBERT-147.
☆29Updated 7 months ago
alumae / streaming-punctuator
☆17Updated 2 years ago
omogr / omogre
Russian accentuator and IPA transcriber
☆13Updated 10 months ago
D-Keqi / LS-Transducer-SST
☆11Updated last year
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
☆11Updated 3 years ago
yuhangear / wenet-android
☆12Updated 3 years ago
just-ai / speechflow
☆27Updated last month
Hannes1 / react-native-wenet
Wenet speech to text for react native
☆10Updated 2 years ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆26Updated last year
alumae / torch-xvectors-wav
☆22Updated 4 years ago
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆25Updated last year
speechio / asr-noises
A handy dataset of noises for ASR
☆21Updated 6 years ago
kgnlp / allophant
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
☆25Updated 4 months ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
mzboito / IWSLT2022_Tamasheq_data
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Updated 2 years ago
hlt-mt / Speech-MASSIVE
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆22Updated 10 months ago
poleval / 2021-punctuation-restoration
PolEval 2021 Task 1
☆15Updated 3 years ago
tiro-is / tiro-speech-core
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Updated 2 years ago
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
C++ version of pyannote audio overlapped speech detection pipeline
☆13Updated last year
csalt-research / accented-codebooks-asr
☆18Updated 10 months ago
mcf330 / efts2code
source code of EfficientTTS 2
☆16Updated last year