sanchit-gandhi / codesnippetsLinks
β10Updated last year
Alternatives and similar repositories for codesnippets
Users that are interested in codesnippets are comparing it to the libraries listed below
Sorting:
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β37Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ149Updated last year
- Speaker Diarization with Transformersβ69Updated 3 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- β20Updated 2 years ago
- MAFAND-MTβ59Updated last year
- Open TTS models, built for streaming on the edgeβ43Updated 6 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiβ¦β41Updated 2 years ago
- extending laughbot project to encoder-based transformer model finetuned on same dataset for humor classificationβ10Updated 2 years ago
- Text to Speech for Indic languagesβ51Updated 3 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub π€β‘οΈβ35Updated 3 years ago
- Using short models to classify long textsβ21Updated 2 years ago
- π« check your data, before you wreck your modelβ16Updated 3 years ago
- Speaker diarization serviceβ24Updated 3 months ago
- β158Updated 2 years ago
- Open Source Speech Inferencing Libary for Indic Languagesβ13Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β87Updated 3 years ago
- β62Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downsβ¦β32Updated 4 years ago
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago
- A list of scripts/notebooks I'd like to keep handyβ18Updated last year
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.β18Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated 2 weeks ago
- β131Updated last week
- β47Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languagesβ77Updated 3 years ago