A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
Alternatives and similar repositories for wav2vec-toolkit
Users that are interested in wav2vec-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆10Mar 29, 2021Updated 5 years ago
- JAX implementation of VQGAN☆91Jul 9, 2022Updated 3 years ago
- Fastai community entry to 2020 Reproducibility Challenge☆17Oct 20, 2022Updated 3 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Common Voice Generator using Speech Synthesizer☆13Jul 28, 2021Updated 4 years ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 5 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Aug 31, 2022Updated 3 years ago
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- ☆10Feb 2, 2024Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Fast Image Integrity Checker: Scan for corrupted images using Nvidia DALI☆22Jun 20, 2021Updated 4 years ago
- ☆12Mar 17, 2026Updated last month
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Apr 2, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 6 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆138Feb 20, 2024Updated 2 years ago
- A repo for code based language models☆18Feb 10, 2021Updated 5 years ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆23Jun 3, 2023Updated 2 years ago
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆380Nov 22, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆205Feb 22, 2022Updated 4 years ago
- Persian GPT2☆42May 28, 2021Updated 4 years ago
- 🔭 interactively explore `onnx` networks in your CLI.☆26Jun 7, 2024Updated last year
- ☆44Aug 2, 2021Updated 4 years ago
- [2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진 📷 - 부스트캠프 AI Tech 3기 최종 프로젝트☆14Jun 11, 2022Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆469Jul 13, 2023Updated 2 years ago
- Port of BaseFlight (with MultiWii 2.3 features) for STM32F4DISCOVERY board + GY-86 (mpu6050 + hmc5883 + ms5611) sensors board☆15Feb 3, 2014Updated 12 years ago