Vaibhavs10 / how-to-asrLinks
☆18Updated 3 years ago
Alternatives and similar repositories for how-to-asr
Users that are interested in how-to-asr are comparing it to the libraries listed below
Sorting:
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Text to Speech for Indic languages☆52Updated 3 years ago
- docker for HF wav2vec2-sprint☆13Updated 4 years ago
- ☆33Updated 6 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper☆12Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆48Updated 10 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- ☆99Updated 2 years ago
- [DEPRECATED] Audio Module for fastai v2☆66Updated 2 years ago
- A python package for whisper normalizer☆71Updated 3 months ago
- scipts for working with open.bible data☆26Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Updated 4 years ago
- ☆33Updated last year
- ☆47Updated 5 years ago
- Metaflow tutorials for ODSC West 2021☆64Updated 4 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆11Updated 3 years ago
- Demo bear classifier with fastai and Voila☆39Updated 2 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Updated 3 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆11Updated 6 years ago
- Build fast gradio demos of fastai learners☆35Updated 4 years ago
- ☆41Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Updated 4 years ago
- ☆24Updated 5 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆31Updated 2 years ago
- HF's ML for Audio study group☆199Updated 2 years ago
- My Dream is that each one of these code snippets will become a blog post. So let's take this dream one snippet at a time :)☆35Updated 5 years ago
- Minimal starting point for rapid prototyping interactive Human-AI tools☆33Updated 3 years ago