sanchit-gandhi / codesnippetsLinks
☆10Updated last year
Alternatives and similar repositories for codesnippets
Users that are interested in codesnippets are comparing it to the libraries listed below
Sorting:
- Using short models to classify long texts☆21Updated 2 years ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆41Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- MAFAND-MT☆60Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆21Updated 2 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Updated 2 years ago
- Text to Speech for Indic languages☆52Updated 3 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- ☆157Updated 2 years ago
- Speaker Diarization with Transformers☆69Updated 7 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated 2 years ago
- Tools for managing datasets for governance and training.☆87Updated 2 weeks ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆32Updated 4 years ago
- ☆33Updated 2 years ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Updated 4 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆49Updated 2 years ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆116Updated 9 months ago
- QLoRA for Masked Language Modeling☆22Updated 2 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆35Updated 3 years ago
- A tiny BERT for low-resource monolingual models☆31Updated last month