sadhusamik / speech_recognition_toolsLinks

☆8

Alternatives and similar repositories for speech_recognition_tools

Users that are interested in speech_recognition_tools are comparing it to the libraries listed below

Sorting:

aispeech-lab / TinyWASE
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Updated 4 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
☆12Updated 4 years ago
X-LANCE / BER
Balanced Error Rate for Speaker Diarization
☆32Updated 2 years ago
desh2608 / pytorch-tdnn
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆40Updated 4 years ago
qiujiali / lattice-rescore
☆16Updated 3 years ago
talhanai / kaldi-diar-latte
steps to perform text-based speaker diarization with kaldi toolkit
☆11Updated 6 years ago
jyhan03 / icassp22-dataset
Dataset simulation for DPCCN.
☆16Updated 2 years ago
robflynnyh / long-context-asr
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆10Updated 2 months ago
nwpuaslp / ASC_baseline
☆20Updated 4 years ago
idiap / icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
☆25Updated 8 months ago
desh2608 / kaldi-noise-vectors
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Updated 4 years ago
KrishnaDN / E2E_ASR_Confidence_Estimation
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16Updated 4 years ago
skhu101 / Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…
☆9Updated 3 years ago
wangfangyuan / SChunk-Encoder
SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR
☆9Updated 2 years ago
leto19 / WhiSQA
Whisper Speech Quality Assessment (WhiSQA)
☆10Updated 7 months ago
csalt-research / accented-codebooks-asr
☆18Updated 10 months ago
fgnt / mms_msg
Multipurpose Multi Speaker Mixture Signal Generator
☆44Updated 5 months ago
Aariciah / allophoible
An extension of PHOIBLE that includes features for allophones.
☆10Updated 2 years ago
Lhx94As / E2E-language-diarization
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Updated 3 years ago
BUTSpeechFIT / ASR-hybrid-decoding
☆16Updated 5 years ago
JSALT2022CodeSwitchingASR / generating-code-switched-audio
☆12Updated 5 months ago
BUTSpeechFIT / OOV-recovery-in-hybrid-ASR-system
☆9Updated 5 years ago
sadhusamik / fdlp_spectrogram
☆14Updated 2 years ago
yucongzh / online_speaker_diarization
☆14Updated 3 years ago
hmohebbi / disentangling_representations
☆12Updated 9 months ago
dhimasryan / TMHINT-QI-VoiceMOS2023
☆17Updated last year
mechanicalsea / sugar
Efficient Speech Processing Tookit for Automatic Speaker Recognition
☆17Updated 2 years ago
Miamoto / Conformer-NTM
☆15Updated last year
chimechallenge / chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆23Updated 4 months ago