jonatasgrosman/asrecognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jonatasgrosman/asrecognition)

jonatasgrosman / asrecognition

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

☆51

Alternatives and similar repositories for asrecognition

Users that are interested in asrecognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gabrielmbmb / candle-holder
View on GitHub
A Rust crate offering similar functionality to the Python transformers package using Candle.
☆15Nov 19, 2024Updated last year
jonatasgrosman / huggingsound
View on GitHub
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
☆470Sep 20, 2023Updated 2 years ago
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
cvqluu / dropclass_speaker
View on GitHub
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
☆22Oct 29, 2020Updated 5 years ago
qqaatw / pytorch-realm-orqa
View on GitHub
PyTorch reimplementation of REALM and ORQA
☆22Feb 3, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
Dipet / albumentations_gui
View on GitHub
GUI for albumentations library
☆11Sep 13, 2019Updated 6 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
View on GitHub
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆110Aug 31, 2022Updated 3 years ago
Guitaricet / my_pefty_llama
View on GitHub
Minimal implementation of multiple PEFT methods for LLaMA fine-tuning
☆13May 7, 2023Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
nikhil-vartak / json-to-html-converter
View on GitHub
Converts JSON data to HTML table with collapsible details view for nested objects.
☆14May 1, 2021Updated 5 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
nonverbalspeech38k / nonverspeech38k
View on GitHub
The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…
☆68Dec 26, 2025Updated 6 months ago
msalhab96 / Listen-Attend-and-Spell
View on GitHub
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Mar 4, 2022Updated 4 years ago
speech-utcluj / thetaOscillator-syllable-segmentation
View on GitHub
Oscillator-based speech syllabification algorithm
☆11Sep 27, 2019Updated 6 years ago
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago
guyyariv / AudioToken
View on GitHub
[InterSpeech 2023] The official PyTorch implementation of: "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Imag…
☆89May 18, 2026Updated 2 months ago
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆26Aug 24, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆378Feb 4, 2024Updated 2 years ago
George0828Zhang / simulst
View on GitHub
PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.
☆25Oct 3, 2022Updated 3 years ago
google-research-datasets / cvss
View on GitHub
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
☆220Aug 26, 2022Updated 3 years ago
RicherMans / Datadriven-GPVAD
View on GitHub
The codebase for Data-driven general-purpose voice activity detection.
☆93Aug 3, 2023Updated 2 years ago
qcri / e-wer
View on GitHub
Word Error Rate Estimation
☆16Aug 25, 2020Updated 5 years ago
aixplain / NoRefER
View on GitHub
☆18Jun 5, 2026Updated last month
csukuangfj / icefall
View on GitHub
☆11Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ccoreilly / wav2vec2-catala
View on GitHub
Wav2Vec 2.0 catalan training scripts and models
☆12Jun 18, 2021Updated 5 years ago
alvenirai / punctfix
View on GitHub
☆24Feb 16, 2024Updated 2 years ago
thevasudevgupta / bigbird
View on GitHub
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
☆49Mar 20, 2023Updated 3 years ago
harvard-edge / multilingual_kws
View on GitHub
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆190Dec 6, 2024Updated last year
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
dqqcasia / st
View on GitHub
End-to-end Speech Translation
☆35Apr 12, 2021Updated 5 years ago
philschmid / transformers-pytorch-text-classification
View on GitHub
☆14Jan 24, 2022Updated 4 years ago