Neural end-to-end Speech Translation Toolkit
☆307Jun 28, 2022Updated 3 years ago
Alternatives and similar repositories for neurst
Users that are interested in neurst are comparing it to the libraries listed below
Sorting:
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Tracking the progress in end-to-end speech translation☆261Oct 25, 2023Updated 2 years ago
- ☆179Nov 10, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆220Aug 26, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"☆442Feb 2, 2022Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Jun 1, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- ☆89Mar 5, 2021Updated 5 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆396Sep 14, 2021Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- Chinese text normalization for speech processing☆721Mar 18, 2023Updated 2 years ago
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Oct 25, 2023Updated 2 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,303May 16, 2023Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Aug 5, 2021Updated 4 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- ☆27Aug 31, 2022Updated 3 years ago
- Tools for handling multimodal data in machine learning projects.☆1,114Feb 26, 2026Updated last week
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆565Apr 2, 2023Updated 2 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year