A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code is followed by kaituo xu's work.
☆10Dec 25, 2019Updated 6 years ago
Alternatives and similar repositories for Speech-Transformer-multi-GPUs
Users that are interested in Speech-Transformer-multi-GPUs are comparing it to the libraries listed below
Sorting:
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Mar 29, 2019Updated 6 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 7 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.☆21Apr 14, 2020Updated 5 years ago
- Detect emotion from audio☆13Nov 20, 2018Updated 7 years ago
- ☆20Sep 2, 2024Updated last year
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Script to perform statistical significance test between ASR hypotheses.☆22Aug 13, 2017Updated 8 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Dec 16, 2019Updated 6 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 4 years ago
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 4 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- A pytorch based end2end speech recognition system.☆114Jan 16, 2021Updated 5 years ago
- Conferencing Speech Challenge☆95Apr 6, 2021Updated 4 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- An AI sidekick that helps you control your computer.☆34Oct 15, 2021Updated 4 years ago
- ☆36Sep 6, 2025Updated 5 months ago
- FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile softw…☆31Jul 20, 2022Updated 3 years ago
- ☆32Nov 24, 2024Updated last year
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆34Mar 22, 2021Updated 4 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Mar 19, 2014Updated 11 years ago
- ☆30Jun 12, 2025Updated 8 months ago
- ☆30Dec 25, 2023Updated 2 years ago
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- ☆10Jun 24, 2020Updated 5 years ago
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆80Dec 8, 2022Updated 3 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.☆38Mar 12, 2024Updated last year
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Dec 8, 2019Updated 6 years ago
- DCASE2020 Challenge Task 1 baseline system☆25Jun 22, 2020Updated 5 years ago