jonatasgrosman / asrecognitionView external linksLinks
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
☆51Mar 6, 2023Updated 2 years ago
Alternatives and similar repositories for asrecognition
Users that are interested in asrecognition are comparing it to the libraries listed below
Sorting:
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆469Sep 20, 2023Updated 2 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Dec 26, 2025Updated last month
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆14Nov 19, 2024Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- ☆10Sep 19, 2022Updated 3 years ago
- ☆11Nov 28, 2025Updated 2 months ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- Nvidia GPU Fan Controller for linux☆15May 27, 2024Updated last year
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆219Aug 26, 2022Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Mar 25, 2023Updated 2 years ago
- ☆204Feb 22, 2022Updated 3 years ago
- A tool for assignment to a slice in TensorFlow☆20Jul 23, 2021Updated 4 years ago
- High level Rust bindings for libsamplerate.☆18Sep 15, 2023Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Aug 22, 2017Updated 8 years ago
- Model Fusion Based Prosody Prediction☆17Mar 18, 2018Updated 7 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆88Jun 18, 2024Updated last year
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆328Sep 24, 2022Updated 3 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Jul 16, 2022Updated 3 years ago
- An open-source implementation of sequence-to-sequence based speech processing engine☆39Jan 11, 2023Updated 3 years ago
- ONNX export and inference for SAM3.☆48Dec 17, 2025Updated last month
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- Keras implement of Lazy optimizer☆21Nov 24, 2019Updated 6 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Oct 23, 2021Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago