Vaibhavs10 / how-to-asrLinks
☆18Updated 3 years ago
Alternatives and similar repositories for how-to-asr
Users that are interested in how-to-asr are comparing it to the libraries listed below
Sorting:
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- docker for HF wav2vec2-sprint☆13Updated 4 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- Dataset Release for Intent Classification from Speech☆47Updated 6 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Updated 4 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- A python package for whisper normalizer☆64Updated 3 weeks ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper☆12Updated last year
- ☆47Updated 5 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- scipts for working with open.bible data☆25Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Minimal starting point for rapid prototyping interactive Human-AI tools☆33Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Updated 3 years ago
- ☆32Updated last year
- ☆33Updated 6 years ago
- My classnotes, experiments, reproducible notebooks from fast.ai Deep Learning Class (v2)☆36Updated 7 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- (Silver medal - 60th place - Top 3%) Repository for the "Tweet Sentiment Extraction" Kaggle competition.☆10Updated 5 years ago
- ☆99Updated 2 years ago
- Neural Search System on Arxiv AI/ML Papers☆54Updated 4 years ago
- Model-Logger is a Python library for storing model's profile and rapid inter model comparison.☆61Updated 2 years ago
- [DEPRECATED] Audio Module for fastai v2☆65Updated 2 years ago