zzpDapeng/Transformer-Transducer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zzpDapeng/Transformer-Transducer)

zzpDapeng / Transformer-Transducer

A streamable speech recognition model with transformer encoders and RNN-T loss

☆11

Alternatives and similar repositories for Transformer-Transducer

Users that are interested in Transformer-Transducer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eastonYi / end-to-end_asr_pytorch
View on GitHub
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆23Jul 28, 2020Updated 5 years ago
upskyy / Transformer-Transducer
View on GitHub
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆114Feb 27, 2022Updated 4 years ago
lightning830 / E2E-audio-speech-recognition
View on GitHub
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
sooftware / openspeech
View on GitHub
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
☆35Oct 18, 2021Updated 4 years ago
luckynote / openpose
View on GitHub
OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library
☆11Jul 13, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
diaoenmao / Speech-Emotion-Recognition-with-Dual-Sequence-LSTM-Architecture
View on GitHub
[ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture
☆12Jan 17, 2025Updated last year
Dystopiaz / wake-up-android
View on GitHub
语音唤醒
☆13Dec 12, 2018Updated 7 years ago
dobby-seo / kosr
View on GitHub
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
☆31Feb 19, 2021Updated 5 years ago
xiaoquanjie / netlib2
View on GitHub
跨平台网络库，使用epoll和iocp模型
☆11May 13, 2018Updated 8 years ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
kosaurang / google-research
View on GitHub
Google Research
☆10Apr 20, 2022Updated 4 years ago
XinJiang1994 / HFmaml
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
rajivpoddar / mmse-port
View on GitHub
MMSE STSA Speech enhancement
☆15Aug 24, 2015Updated 10 years ago
Zeng-Jia / CMKD-MINDS
View on GitHub
Official pytorch implementation of Cross Modality Knowledge Distillation between A-mode Ultrasound and Surface Electromyography.
☆14May 23, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tai2 / wave_reader_and_writer
View on GitHub
Reading and writing Windows WAVE file format.
☆17Jan 24, 2022Updated 4 years ago
upskyy / ContextNet
View on GitHub
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Feb 27, 2022Updated 4 years ago
tongjinle123 / speech-transformer-pytorch_lightning
View on GitHub
ASR project with pytorch-lightning
☆20Mar 21, 2025Updated last year
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
rajatkoner08 / rtn
View on GitHub
This is a code repository for Relation Transformer Network
☆13Nov 30, 2021Updated 4 years ago
azzaouiyazid / Adaptive-Comb-Filtering-Algorithm-for-Harmonic-Signal-Enhancement
View on GitHub
An adaptive comb filtering algorithm for the enhancement of harmonic signals in the presence of additive white noise. The algorithm impro…
☆14Jan 10, 2023Updated 3 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
IIGROUP / PUM
View on GitHub
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
☆19May 7, 2021Updated 5 years ago
xiaochus / DeepModelDeploy
View on GitHub
Deploy deep learning model on difference hardware and framework. (TensorRT/ONNX/MNN/RKNN)
☆13Jan 2, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LibertFan / MAN
View on GitHub
Mask Attention Networks: Rethinking and Strengthen Transformer in NAACL2021
☆14Jun 3, 2021Updated 5 years ago
giaminhgist / 3D-DAM
View on GitHub
A reproducible 3D convolutional neural network with dual attention module (3D-DAM) for Alzheimer's disease classification
☆18Nov 5, 2024Updated last year
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
buptmiao / threadpool
View on GitHub
用C++实现的一个简单的线程池，支持任务队列，实际任务继承自taskbase。
☆12Apr 15, 2015Updated 11 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 3 weeks ago
elianap / divexplorer
View on GitHub
☆11May 5, 2022Updated 4 years ago
anushka23g / Parkinson-Disease-Classification
View on GitHub
A Machine Learning Approach for the Diagnosis of Parkinson's Disease via Speech Analysis
☆21Dec 27, 2020Updated 5 years ago
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
richouzo / hate-speech-detection-survey
View on GitHub
Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the …
☆21Dec 14, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
fd873630 / RNN-Transducer
View on GitHub
RNN-Transducer for korean
☆45Oct 31, 2020Updated 5 years ago
robgon-art / BIG.art
View on GitHub
Using Machine Learning to Create High-Res Fine Art
☆13Sep 24, 2023Updated 2 years ago
bene-ges / nemo_compatible
View on GitHub
useful things that work with NVIDIA NeMo library
☆14Jan 20, 2024Updated 2 years ago
aispeech-lab / DiffuseNoiseGeneration
View on GitHub
☆25Nov 23, 2021Updated 4 years ago
7thTool / XSocket
View on GitHub
简单的Modern C++ Socket跨平台可伸缩实现
☆21Apr 22, 2021Updated 5 years ago
ChriswooTalent / Yolo_on_Caffe
View on GitHub
Yolo(including yolov1 yolov2 yolov3)running on caffe windows. Anyone that is not familiar with linux can use this project to learn caffe …
☆18Jun 15, 2018Updated 8 years ago
amruthraghav / FINA4350
View on GitHub
Group Project Work of Text Analytics and Natural Language Processing in Finance and Fintech
☆12Dec 1, 2020Updated 5 years ago