fchest/Speech-Transformer-multi-GPUs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fchest/Speech-Transformer-multi-GPUs)

fchest / Speech-Transformer-multi-GPUs

A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code is followed by kaituo xu's work.

☆10

Alternatives and similar repositories for Speech-Transformer-multi-GPUs

Users that are interested in Speech-Transformer-multi-GPUs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
zengchang233 / Speaker_Verification_Tencent
View on GitHub
Deep Discriminative Embeddings for Duration Robust Speaker Verification
☆19Dec 16, 2019Updated 6 years ago
Xiaoxx18 / DeepFilterNet
View on GitHub
☆17May 18, 2024Updated 2 years ago
korokes / MCLS
View on GitHub
Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos
☆10Sep 2, 2024Updated last year
matln / voxceleb_triplet-loss
View on GitHub
A Pytorch implementation of triplet loss on VoxCeleb1
☆12Oct 16, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Xiaoxx18 / FireRedASR-LLM
View on GitHub
小红书asr模型的训练代码
☆16Jan 13, 2026Updated 6 months ago
yuguochencuc / CinCGAN-SE
View on GitHub
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
☆10Jan 24, 2022Updated 4 years ago
toni-heittola / dcase2020_task1_baseline
View on GitHub
DCASE2020 Challenge Task 1 baseline system
☆25Jun 22, 2020Updated 6 years ago
npuichigo / ttsflow
View on GitHub
tensorflow speech synthesis c++ inference for voicenet
☆16Mar 29, 2019Updated 7 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
fchest / CSENet
View on GitHub
Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)
☆14Jun 23, 2022Updated 4 years ago
hhhaaahhhaa / ASR-TTA
View on GitHub
☆16Nov 4, 2025Updated 8 months ago
TakHemlata / RawBoost-antispoofing
View on GitHub
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…
☆78Sep 24, 2023Updated 2 years ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
ehabets / ANF-Generator
View on GitHub
Generating non-stationary multi-sensor signals under a spatial coherence constraint (MATLAB)
☆50Sep 25, 2024Updated last year
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
zengchang233 / MTGAN
View on GitHub
MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks
☆19Feb 29, 2020Updated 6 years ago
ConferencingSpeech / ConferencingSpeech2021
View on GitHub
Conferencing Speech Challenge
☆95Apr 6, 2021Updated 5 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
cageyoko / CTC-Attention-Mispronunciation
View on GitHub
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆64Apr 29, 2021Updated 5 years ago
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
by2101 / OpenASR
View on GitHub
A pytorch based end2end speech recognition system.
☆115Jan 16, 2021Updated 5 years ago
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
tan90xx / distillw2n
View on GitHub
🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features
☆26Dec 10, 2025Updated 7 months ago
nickjw0205 / Improving-ASR-with-LLM-Description
View on GitHub
☆20Sep 2, 2024Updated last year
hshi-speech / Dereverberation-toolkit-for-REVERB-challenge
View on GitHub
Deep Learning Based Monaural Speech Dereverberation Models: Hope We Can Get Better Performance of Dereverberation
☆20Mar 16, 2022Updated 4 years ago
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
wangkenpu / Conv-TasNet-PyTorch
View on GitHub
A PyTorch implementation of Conv-TasNet
☆46Nov 25, 2019Updated 6 years ago
zengchang233 / asv_neural_network
View on GitHub
neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)
☆21Oct 23, 2019Updated 6 years ago
dalinvip / PyTorch_Chinese_word_segmentation
View on GitHub
Chinese word segmentation with the neural seq2seq model implement in pytorch
☆10Dec 13, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Andong-Li-speech / DARCN
View on GitHub
The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"
☆80Dec 8, 2022Updated 3 years ago
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
JohnsonLee1999 / 2021TJUThesisLatexTemplate
View on GitHub
2021届天津大学最新毕设latex模板。
☆13May 25, 2021Updated 5 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
xingchensong / Speech-Transformer-plus-2DAttention
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆12May 7, 2019Updated 7 years ago
GangmingZhao / pytorch-boat
View on GitHub
This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer
☆17Mar 29, 2022Updated 4 years ago