ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
☆50Mar 6, 2023Updated 3 years ago
Alternatives and similar repositories for asrecognition
Users that are interested in asrecognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Rust crate offering similar functionality to the Python transformers package using Candle.☆14Nov 19, 2024Updated last year
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆470Sep 20, 2023Updated 2 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆55Jan 13, 2023Updated 3 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆64Dec 26, 2025Updated 3 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- GUI for albumentations library☆11Sep 13, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Mar 25, 2023Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 3 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆88Jun 18, 2024Updated last year
- Oscillator-based speech syllabification algorithm☆11Sep 27, 2019Updated 6 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- ☆17Mar 1, 2024Updated 2 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆378Feb 4, 2024Updated 2 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- ☆11Mar 4, 2026Updated 3 weeks ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆221Aug 26, 2022Updated 3 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- Converts JSON data to HTML table with collapsible details view for nested objects.☆14May 1, 2021Updated 4 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Wav2Vec 2.0 catalan training scripts and models☆12Jun 18, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Mar 20, 2023Updated 3 years ago
- ☆24Feb 16, 2024Updated 2 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆187Dec 6, 2024Updated last year
- High level Rust bindings for libsamplerate.☆18Sep 15, 2023Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago