☆76Oct 25, 2021Updated 4 years ago
Alternatives and similar repositories for sew
Users that are interested in sew are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆469Jul 13, 2023Updated 2 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆479Apr 5, 2024Updated last year
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 7 months ago
- ☆16Jun 13, 2022Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Blitzing Fast CTC Beam Search Decoder☆186Oct 27, 2025Updated 4 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- ☆37Updated this week
- Library for Textless Spoken Language Processing☆555Aug 29, 2023Updated 2 years ago
- A library for speech data augmentation in time-domain☆684Aug 30, 2021Updated 4 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- ☆13Sep 12, 2024Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆186Dec 6, 2024Updated last year
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 9 months ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆12Feb 9, 2021Updated 5 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,538Mar 12, 2026Updated last week
- Temporary anonymous version☆22Mar 20, 2024Updated 2 years ago
- Wav2Vec 2.0 catalan training scripts and models☆12Jun 18, 2021Updated 4 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Aug 3, 2023Updated 2 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆566Apr 2, 2023Updated 2 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- ☆37Jun 30, 2022Updated 3 years ago