☆76Oct 25, 2021Updated 4 years ago
Alternatives and similar repositories for sew
Users that are interested in sew are comparing it to the libraries listed below
Sorting:
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆468Jul 13, 2023Updated 2 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- ☆37Nov 22, 2025Updated 3 months ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆479Apr 5, 2024Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- Blitzing Fast CTC Beam Search Decoder☆186Oct 27, 2025Updated 4 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- ☆12Feb 9, 2021Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆185Dec 6, 2024Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆17Apr 28, 2021Updated 4 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆41Feb 9, 2023Updated 3 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- Library for Textless Spoken Language Processing☆555Aug 29, 2023Updated 2 years ago