harveenchadha / bolLinks
Open Source Speech Inferencing Libary for Indic Languages
β12Updated 3 years ago
Alternatives and similar repositories for bol
Users that are interested in bol are comparing it to the libraries listed below
Sorting:
- Text to Speech for Indic languagesβ51Updated 3 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β36Updated 2 years ago
- Dataset Release for Intent Classification from Speechβ47Updated 4 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languagesβ9Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β22Updated 2 years ago
- β16Updated 4 months ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiβ¦β39Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]β12Updated 5 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ24Updated 4 years ago
- A python package for whisper normalizerβ63Updated last month
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- Code for AccentDB.β22Updated 4 years ago
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- β11Updated 3 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tagsβ11Updated 5 years ago
- β76Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ33Updated 3 years ago
- Shoonya - Platform to Annotate and label data at scale.β56Updated 10 months ago
- a repository containing the details of natural language inference dataset in Hindiβ11Updated 4 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paperβ12Updated last year
- A library for data streaming and augmentationβ20Updated 2 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β45Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ31Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"β11Updated 5 years ago
- c++ mosestokenizerβ18Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β27Updated last year