anton-l/wav2vec-toolkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anton-l/wav2vec-toolkit)

anton-l / wav2vec-toolkit

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

☆30

Alternatives and similar repositories for wav2vec-toolkit

Users that are interested in wav2vec-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

m3hrdadfi / zabanshenas
View on GitHub
Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )
☆19Aug 2, 2021Updated 4 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
maxidl / wav2vec2
View on GitHub
☆10Mar 29, 2021Updated 5 years ago
patil-suraj / vqgan-jax
View on GitHub
JAX implementation of VQGAN
☆91Jul 9, 2022Updated 4 years ago
HLasse / multidiagnosis-speech
View on GitHub
☆10Jun 23, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 5 years ago
asappresearch / wav2seq
View on GitHub
Official code for Wav2Seq
☆97Jul 19, 2022Updated 4 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
Edresson / Wav2Vec-Wrapper
View on GitHub
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆80May 20, 2023Updated 3 years ago
robvanvolt / DALLE-tools
View on GitHub
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
☆14Mar 9, 2022Updated 4 years ago
jqueguiner / wav2vec2-sprint
View on GitHub
docker for HF wav2vec2-sprint
☆13Mar 26, 2021Updated 5 years ago
karndeb / Arxiv-Neural-Search
View on GitHub
Neural Search System on Arxiv AI/ML Papers
☆54Aug 4, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gchhablani / multilingual-vqa
View on GitHub
Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.
☆33Jul 27, 2021Updated 5 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆379Feb 4, 2024Updated 2 years ago
orevaahia / magnet-tokenization
View on GitHub
☆11Mar 17, 2026Updated 4 months ago
sanchit-gandhi / seq2seq-speech
View on GitHub
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆39Feb 23, 2023Updated 3 years ago
DonkeyShot21 / uis-rnn-sml
View on GitHub
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆61Apr 15, 2020Updated 6 years ago
m3hrdadfi / soxan
View on GitHub
Wav2Vec for speech recognition, classification, and audio classification
☆276Apr 2, 2022Updated 4 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mansourehk / ShEMO
View on GitHub
Sharif Emotional Speech Database
☆38Jan 9, 2021Updated 5 years ago
georgian-io / Knowledge-Distillation-Toolkit
View on GitHub
[DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
☆138Feb 20, 2024Updated 2 years ago
DorBernsohn / CodeLM
View on GitHub
A repo for code based language models
☆18Feb 10, 2021Updated 5 years ago
qdrant / quaterion-models
View on GitHub
The collection of bulding blocks building fine-tunable metric learning models
☆35Jul 6, 2026Updated 3 weeks ago
parvathysarat / gpt2-text-generation
View on GitHub
Fine-tuning GPT-2 on articles followed by text generation
☆23Apr 19, 2022Updated 4 years ago
microsoft / Interactive-Summarization
View on GitHub
The official repo of our research work "Interactive Editing for Text Summarization".
☆23Jun 3, 2023Updated 3 years ago
aikindergarten / fasthugs
View on GitHub
Training HuggingFace models using fastai
☆11Jul 22, 2021Updated 5 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
google-research-datasets / lareqa
View on GitHub
LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…
☆14May 19, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
rawbeen248 / audio_classification_finetuning
View on GitHub
This project focuses on the classification of animal sounds using deep learning. The core idea is to utilize audio processing techniques …
☆10Dec 3, 2024Updated last year
hooshvare / parsgpt
View on GitHub
Persian GPT2
☆42May 28, 2021Updated 5 years ago
tmabraham / fastai_tpu
View on GitHub
TPU support for the fastai library
☆14Apr 15, 2021Updated 5 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
yuxiang-wu / gen-debiased-nli
View on GitHub
☆20May 12, 2022Updated 4 years ago