ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
☆51Mar 6, 2023Updated 3 years ago
Alternatives and similar repositories for asrecognition
Users that are interested in asrecognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆15Nov 19, 2024Updated last year
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆470Sep 20, 2023Updated 2 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆55Jan 13, 2023Updated 3 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆66Dec 26, 2025Updated 5 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- ☆206Feb 22, 2022Updated 4 years ago
- GUI for albumentations library☆11Sep 13, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- A library of speech gadgets.☆15Oct 15, 2022Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 4 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- Oscillator-based speech syllabification algorithm☆11Sep 27, 2019Updated 6 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Repo for the FB AI Speech team.☆26Aug 24, 2021Updated 4 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆379Feb 4, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- ☆11Jun 11, 2026Updated last week
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- ☆18Jun 5, 2026Updated 2 weeks ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆222Aug 26, 2022Updated 3 years ago
- Converts JSON data to HTML table with collapsible details view for nested objects.☆14May 1, 2021Updated 5 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Wav2Vec 2.0 catalan training scripts and models☆12Jun 18, 2021Updated 5 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Mar 20, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Feb 16, 2024Updated 2 years ago
- Fast Russian Text normalization for TTS using only RegEx.☆30Updated this week
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆187Dec 6, 2024Updated last year
- High level Rust bindings for libsamplerate.☆18Sep 15, 2023Updated 2 years ago
- Deep learning model to classify relationship state in romantic couples from images and video☆15Jul 1, 2019Updated 6 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆347May 15, 2024Updated 2 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 5 years ago