anton-l / wav2vec-toolkit
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆31Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for wav2vec-toolkit
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆110Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆74Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- docker for HF wav2vec2-sprint☆12Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- ☆40Updated 2 years ago
- Text to Speech for Indic languages☆48Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 4 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 2 years ago
- Script to train a German n-gram Language Model on articles of Wikipedia☆13Updated 6 years ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- ☆41Updated last year
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆48Updated 2 months ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- ☆15Updated 5 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- Repository with illustrations for cft-contest-2018☆12Updated 6 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆61Updated 3 years ago
- ☆40Updated 2 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- ☆56Updated last year
- ☆12Updated 3 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆75Updated 2 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago