SEAME corpus two develop set
☆42Dec 5, 2019Updated 6 years ago
Alternatives and similar repositories for SEAME-dev-set
Users that are interested in SEAME-dev-set are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆52Jul 30, 2020Updated 5 years ago
- A curated list of research papers and resources on code-switching☆333Jan 31, 2026Updated last month
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Nov 18, 2024Updated last year
- ☆15May 8, 2021Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 6 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆57Sep 1, 2025Updated 6 months ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- ☆67Mar 25, 2022Updated 4 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆264Feb 12, 2023Updated 3 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- ☆16Apr 24, 2025Updated 11 months ago
- The official source code of UniAudio☆95Mar 29, 2024Updated last year
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Aug 2, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- End-to-end spoken language identification out of the box.☆48Dec 13, 2020Updated 5 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- ☆135Sep 23, 2020Updated 5 years ago
- ☆18Apr 12, 2017Updated 8 years ago