SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
☆41Feb 9, 2023Updated 3 years ago
Alternatives and similar repositories for SHAS
Users that are interested in SHAS are comparing it to the libraries listed below
Sorting:
- Systems submitted to IWSLT 2021 by the MT-UPC group.☆14Feb 23, 2023Updated 3 years ago
- ☆13Aug 23, 2024Updated last year
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆15Nov 11, 2024Updated last year
- ☆35Sep 1, 2022Updated 3 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆60Jan 16, 2026Updated last month
- ☆12Jun 10, 2021Updated 4 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 2 years ago
- Sisyphus recipies for ASR☆19Updated this week
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Dec 31, 2021Updated 4 years ago
- Hybrid speech synthesiser☆28Feb 18, 2019Updated 7 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- ☆21Sep 24, 2018Updated 7 years ago
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- ☆15Jul 14, 2020Updated 5 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year