cadia-lvl / ss_asrLinks

A semi-supervised sequence-to-sequence ASR

☆10

Alternatives and similar repositories for ss_asr

Users that are interested in ss_asr are comparing it to the libraries listed below

Sorting:

BUTSpeechFIT / OOV-recovery-in-hybrid-ASR-system
☆9Updated 5 years ago
JSALT2022CodeSwitchingASR / generating-code-switched-audio
☆12Updated 5 months ago
wangfangyuan / SChunk-Encoder
SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR
☆9Updated 2 years ago
leduckhai / MultiMed-ST
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation
☆13Updated 3 months ago
skhu101 / Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…
☆9Updated 3 years ago
speechio / asr-noises
A handy dataset of noises for ASR
☆21Updated 6 years ago
bshall / dusted
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Updated 9 months ago
desh2608 / kaldi-noise-vectors
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Updated 4 years ago
D-Keqi / LS-Transducer-SST
☆11Updated last year
ictnlp / MonoAttn-Transducer
Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"
☆11Updated last month
cpii-cai / PunCantonese
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆14Updated 7 months ago
tiro-is / tiro-speech-core
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Updated 2 years ago
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
C++ version of pyannote audio overlapped speech detection pipeline
☆13Updated last year
nethermanpro / ComSL
☆11Updated last year
VKW2021 / kaldi-baseline
kaldi cnn-tdnnf baseline
☆13Updated 3 years ago
pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 9 months ago
xiaoxue1117 / speech-mamba-public
☆11Updated 7 months ago
xuchennlp / S2T
The project for speech translation
☆11Updated last year
KrishnaDN / BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
hmohebbi / disentangling_representations
☆12Updated 9 months ago
shinshoji01 / MacST-project-page
This is the project page of our paper "MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion".
☆11Updated 4 months ago
iamanigeeit / present
☆13Updated 10 months ago
Open-Speech-EkStep / data-acquisition-pipeline
☆17Updated 4 years ago
BUTSpeechFIT / ASR-hybrid-decoding
☆16Updated 5 years ago
yuhangear / wenet-android
☆12Updated 3 years ago
KrishnaDN / E2E_ASR_Confidence_Estimation
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16Updated 4 years ago
reppy4620 / x-vits
☆13Updated 8 months ago
csalt-research / accented-codebooks-asr
☆18Updated 10 months ago
tuanio / nextformer
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆11Updated 2 years ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago