A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
Alternatives and similar repositories for MiniASR
Users that are interested in MiniASR are comparing it to the libraries listed below
Sorting:
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Nov 20, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated this week
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 6 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Making Espnet easier to use☆54Apr 9, 2021Updated 4 years ago
- ☆13Sep 25, 2024Updated last year
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago