SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
☆43Feb 9, 2023Updated 3 years ago
Alternatives and similar repositories for SHAS
Users that are interested in SHAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Systems submitted to IWSLT 2021 by the MT-UPC group.☆14Feb 23, 2023Updated 3 years ago
- ☆13Aug 23, 2024Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆60Mar 13, 2026Updated last week
- ☆35Sep 1, 2022Updated 3 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated 10 months ago
- ☆15Nov 11, 2024Updated last year
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆34May 27, 2023Updated 2 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Dec 1, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆97Nov 20, 2024Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆27Aug 11, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆22May 24, 2023Updated 2 years ago
- Neural end-to-end Speech Translation Toolkit☆307Jun 28, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Spoken Language Translation System☆20Jul 26, 2021Updated 4 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆41Feb 10, 2022Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- ☆76Oct 25, 2021Updated 4 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆32May 14, 2024Updated last year
- ☆21Mar 7, 2025Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 3 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 3 years ago
- AI based singing voice synthesis database generator☆13Aug 12, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆16May 15, 2019Updated 6 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago