mayank-git-hub/ETE-Speech-Recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mayank-git-hub/ETE-Speech-Recognition)

mayank-git-hub / ETE-Speech-Recognition

Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch

☆26

Alternatives and similar repositories for ETE-Speech-Recognition

Users that are interested in ETE-Speech-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wq2012 / SpeakerRecognitionCourseChinese
View on GitHub
☆17Oct 31, 2022Updated 3 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
lightning830 / E2E-audio-speech-recognition
View on GitHub
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
jinsongpan / ASR_Course_Homework
View on GitHub
分享在深蓝学院《语音识别：从入门到精通》第一期课程学习过程中完成的课后作业，供参考。
☆21Sep 13, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
vectominist / End-to-end-ASR-Pytorch-DLHLP
View on GitHub
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
☆17Nov 22, 2020Updated 5 years ago
Anny8910 / Decision-Tree-Classification-on-Diabetes-Dataset
View on GitHub
It shows how to build and optimize Decision Tree Classifier of "Diabetes dataset" using Python Scikit-learn package.
☆16Aug 21, 2020Updated 5 years ago
biyoml / End-to-End-Mandarin-ASR
View on GitHub
End-to-end speech recognition on AISHELL dataset.
☆34Nov 9, 2021Updated 4 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
hirokisince1998 / jasj-bibtex
View on GitHub
日本音響学会誌用BibTeXスタイルファイル
☆11Jan 24, 2022Updated 4 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
diaoenmao / Speech-Emotion-Recognition-with-Dual-Sequence-LSTM-Architecture
View on GitHub
[ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture
☆12Jan 17, 2025Updated last year
tbornt / phoneme_ctc
View on GitHub
Bidirectional dynamic RNN + CTC for phoneme recognition
☆47Jun 24, 2020Updated 6 years ago
hchung12 / espnet-asr
View on GitHub
☆37Dec 23, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
skaligotla / drik-panchanga
View on GitHub
Observational Indian lunisolar calendar using the Swiss ephemeris (Hindu Drik Panchanga).
☆12Oct 3, 2015Updated 10 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
jtkim-kaist / end-point-detection
View on GitHub
☆10Sep 19, 2018Updated 7 years ago
zycv / Speaker-Recognition-Based-on-Deep-Learning-An-Overview
View on GitHub
This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》
☆41Jun 26, 2021Updated 5 years ago
JoungheeKim / K-wav2vec
View on GitHub
☆87Dec 21, 2022Updated 3 years ago
yuweiwan / ASR-HMM-DNN
View on GitHub
speech recognition based on deep neural network/hidden markov model
☆10Jun 3, 2020Updated 6 years ago
paulc00 / dtree_bias_var
View on GitHub
Plot bias, variance and overall accuracy for a boosted ID3 decision tree on the SPECT Heart dataset.
☆13Jun 20, 2019Updated 7 years ago
bagustris / SER_ICSigSys2019
View on GitHub
Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019
☆13Jan 6, 2020Updated 6 years ago
SRPOL-AUI / storir
View on GitHub
☆44Mar 13, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
BitFloyd / Shot_Segmentation
View on GitHub
Project to segment video stream into separate shots
☆13Oct 30, 2018Updated 7 years ago
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
1ytic / open_stt_e2e
View on GitHub
PyTorch end-to-end speech recognition
☆50Dec 30, 2020Updated 5 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
zyascend / End-to-End-Speech-Recognition-Learning
View on GitHub
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
☆12Oct 25, 2020Updated 5 years ago
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
rs-dl / TSAN
View on GitHub
A Two Stage Adaptation Network (TSAN) for remote sensing images classification under single-source-mixed-multiple-target domain adaptatio…
☆16Jan 11, 2023Updated 3 years ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
dobby-seo / Wav2Keyword
View on GitHub
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆110Jan 11, 2023Updated 3 years ago
naka-lab / HDP-GP-HSMM
View on GitHub
☆11Apr 23, 2024Updated 2 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 3 weeks ago
pooya-mohammadi / audio-classification-pytorch
View on GitHub
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any …
☆43Jan 11, 2025Updated last year
elianap / divexplorer
View on GitHub
☆11May 5, 2022Updated 4 years ago
audioku / meta-transfer-learning
View on GitHub
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
☆52Jul 30, 2020Updated 5 years ago