du-ud/kaldi-cslt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/du-ud/kaldi-cslt)

du-ud / kaldi-cslt

☆15

Alternatives and similar repositories for kaldi-cslt

Users that are interested in kaldi-cslt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hieule88 / SpeechSeparation
View on GitHub
Using SepFormer
☆10Feb 2, 2023Updated 3 years ago
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago
dair-iitd / FloDial
View on GitHub
☆12May 18, 2022Updated 4 years ago
foamliu / Listen-Attend-Spell-v2
View on GitHub
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆39Jul 25, 2019Updated 6 years ago
parambharat / whisper-finetuning
View on GitHub
Repository contains code to fine-tune WhisperASR model
☆23Dec 16, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
shaverlee / easyconnect
View on GitHub
一个深信服EasyConnect的自动控制程序
☆13Nov 2, 2020Updated 5 years ago
dhasenfratz / TDNN-Matlab2Cpp
View on GitHub
The code takes a time-delay neural network (TDNN) trained in Matlab and converts it to a C++ class.
☆14Jan 8, 2017Updated 9 years ago
Xuanfang1121 / CRASpell_pytorch
View on GitHub
☆16Jun 18, 2022Updated 4 years ago
LeonWlw / asr_blockformer
View on GitHub
E2E ASR system
☆14Oct 20, 2022Updated 3 years ago
qinyuenlp / wav2vec_finetune
View on GitHub
ASR: fine-tune wav2vec 2.0 with transformers
☆21Sep 13, 2021Updated 4 years ago
KrishnaDN / E2E_ASR_Confidence_Estimation
View on GitHub
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 5 years ago
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
pengyizhou / xjuthesis-master-template
View on GitHub
☆23Mar 26, 2022Updated 4 years ago
DataXujing / ASR-paper
View on GitHub
ASR教程: https://dataxujing.github.io/ASR-paper/
☆26Jul 1, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ddehun / DEnsity
View on GitHub
Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"
☆11May 23, 2023Updated 3 years ago
maltanar / spmv-vector-cache
View on GitHub
A Vector Caching Scheme for Streaming FPGA SpMV Accelerators
☆10Sep 7, 2015Updated 10 years ago
voidful / asr-trainer
View on GitHub
one script for xls-r/xlsr/whisper fine-tuning
☆42Jun 29, 2023Updated 3 years ago
Sea-Snell / MLLibCpp
View on GitHub
A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…
☆10Aug 28, 2018Updated 7 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
telecombcn-dl / 2018-dlsl
View on GitHub
UPC Deep Learning for Speech and Language 2018
☆17Feb 26, 2018Updated 8 years ago
rawbeen248 / audio_classification_finetuning
View on GitHub
This project focuses on the classification of animal sounds using deep learning. The core idea is to utilize audio processing techniques …
☆10Dec 3, 2024Updated last year
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pkufool / open-commands
View on GitHub
☆20Apr 2, 2025Updated last year
xiangxyq / kaldi_rt_decoder
View on GitHub
using microphone
☆16Sep 2, 2021Updated 4 years ago
sea212 / Implementation-of-an-artificial-neural-network-on-a-zynq7045-fpga-using-sdsoc
View on GitHub
This repository contains a SDSoC Project which includes an implementation of a 3-layered artificial neural network (testphase only). It c…
☆12Oct 7, 2016Updated 9 years ago
eaymerich / Sparse2015
View on GitHub
Implementation of COO, CSR, CSC, SSS and TJDS sparse matrix formats.
☆11Jul 15, 2015Updated 11 years ago
zhiyou720 / chinese_bert_ner
View on GitHub
基础pytorch 和 bert的ner模型，实现断句，标点符号预测
☆18Jul 25, 2024Updated last year
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
tarun-bisht / wav2vec2-asr
View on GitHub
wav2vec2 asr with transformers
☆16Oct 26, 2021Updated 4 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
fun-audio-llm / fun-audio-llm.github.io
View on GitHub
FunAudioLLM homepage
☆17Dec 11, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
AGENDD / RWKV-ASR
View on GitHub
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …
☆54Dec 23, 2024Updated last year
wayne0926 / countdown
View on GitHub
很久以前写的人生倒计时工具，由于博客内无法运行，拿出来
☆11Jun 9, 2022Updated 4 years ago
maltanar / fpga-booleanring-bfs
View on GitHub
Hybrid BFS on Xilinx Zynq
☆18Jun 9, 2015Updated 11 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
NKU-HLT / KNN-CTC
View on GitHub
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆42Mar 20, 2024Updated 2 years ago
caskorg / cask
View on GitHub
Generate and tune custom architectures for sparse linear algebra
☆15Jan 20, 2018Updated 8 years ago