desh2608/pytorch-tdnn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/desh2608/pytorch-tdnn)

desh2608 / pytorch-tdnn

Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training

☆41

Alternatives and similar repositories for pytorch-tdnn

Users that are interested in pytorch-tdnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 3 weeks ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆18Jun 12, 2022Updated 4 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
MarceloSancinetti / epa-gop-pykaldi
View on GitHub
☆25Jun 14, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 5 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
Popgun-Labs / SincNetConv
View on GitHub
A PyTorch 1.0 implementation of the convolutions described in SincNet
☆33Jan 30, 2019Updated 7 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
csikasote / BembaSpeech
View on GitHub
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…
☆41Jul 31, 2025Updated 11 months ago
calclavia / tal-asrd
View on GitHub
Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations
☆39Jun 12, 2023Updated 3 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
gtn-org / gtn
View on GitHub
Automatic differentiation with weighted finite-state transducers.
☆127Apr 12, 2022Updated 4 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
shreyas253 / SylNet
View on GitHub
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆27May 25, 2023Updated 3 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
k2-fsa / multi_quantization
View on GitHub
☆46Nov 2, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
LeBenchmark / Interspeech2021
View on GitHub
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆52Oct 8, 2021Updated 4 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
JazminVidal / gop-dnn-epadb
View on GitHub
Goodness of Pronunciation using Kaldi on Epa-DB database
☆35Jan 17, 2024Updated 2 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
1ytic / warp-rna
View on GitHub
Recurrent Neural Aligner
☆51Apr 14, 2020Updated 6 years ago
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
cvqluu / Factorized-TDNN
View on GitHub
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …
☆149Jan 6, 2020Updated 6 years ago
pzelasko / kaldialign
View on GitHub
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆70Jun 15, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
MasonPhonLab / MAPS
View on GitHub
Mason-Alberta Phonetic Segmenter
☆15Feb 24, 2026Updated 5 months ago
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
xinjli / ucla-phonetic-corpus
View on GitHub
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆46May 12, 2023Updated 3 years ago
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago