A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code is followed by kaituo xu's work.
☆10Dec 25, 2019Updated 6 years ago
Alternatives and similar repositories for Speech-Transformer-multi-GPUs
Users that are interested in Speech-Transformer-multi-GPUs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- ☆15May 18, 2024Updated last year
- A Pytorch implementation of triplet loss on VoxCeleb1☆12Oct 16, 2019Updated 6 years ago
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Mar 29, 2019Updated 6 years ago
- DCASE2020 Challenge Task 1 baseline system☆25Jun 22, 2020Updated 5 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Jun 23, 2022Updated 3 years ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆73Sep 24, 2023Updated 2 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- Generating non-stationary multi-sensor signals under a spatial coherence constraint (MATLAB)☆50Sep 25, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Conferencing Speech Challenge☆95Apr 6, 2021Updated 4 years ago
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Feb 29, 2020Updated 6 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 9 months ago
- ☆37Feb 23, 2022Updated 4 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 4 years ago
- ☆36Sep 6, 2025Updated 6 months ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- ☆20Sep 2, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A pytorch based end2end speech recognition system.☆114Jan 16, 2021Updated 5 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 7 years ago
- Deep Learning Based Monaural Speech Dereverberation Models: Hope We Can Get Better Performance of Dereverberation☆20Mar 16, 2022Updated 4 years ago
- neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)☆21Oct 23, 2019Updated 6 years ago
- A PyTorch implementation of Conv-TasNet☆46Nov 25, 2019Updated 6 years ago
- Chinese word segmentation with the neural seq2seq model implement in pytorch☆10Dec 13, 2017Updated 8 years ago
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆80Dec 8, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Jul 19, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Nov 25, 2019Updated 6 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- 2021届天津大学最新毕设latex模板。☆13May 25, 2021Updated 4 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Nov 4, 2020Updated 5 years ago
- This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further u…☆47Jul 6, 2020Updated 5 years ago
- ☆105Sep 2, 2021Updated 4 years ago