Neural end-to-end Speech Translation Toolkit
☆307Jun 28, 2022Updated 3 years ago
Alternatives and similar repositories for neurst
Users that are interested in neurst are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Tracking the progress in end-to-end speech translation☆261Oct 25, 2023Updated 2 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Jun 1, 2022Updated 3 years ago
- ☆179Nov 10, 2021Updated 4 years ago
- Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"☆442Feb 2, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Oct 25, 2023Updated 2 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆221Aug 26, 2022Updated 3 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19May 1, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆27Aug 31, 2022Updated 3 years ago
- A repository containing the code for speech translation papers.☆21Mar 11, 2022Updated 4 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆352Dec 25, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,302May 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆395Sep 14, 2021Updated 4 years ago
- ☆277Jan 15, 2021Updated 5 years ago
- ☆89Mar 5, 2021Updated 5 years ago
- ☆167Dec 24, 2021Updated 4 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆48Feb 21, 2022Updated 4 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆65May 25, 2022Updated 3 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆26Apr 21, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- A library for speech data augmentation in time-domain☆684Aug 30, 2021Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- ☆76Mar 18, 2022Updated 4 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆566Apr 2, 2023Updated 2 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- Chinese text normalization for speech processing☆722Mar 18, 2023Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆43Feb 9, 2023Updated 3 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆78Mar 11, 2021Updated 5 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆362Dec 24, 2021Updated 4 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Aug 5, 2021Updated 4 years ago