☆15Mar 25, 2024Updated last year
Alternatives and similar repositories for W2V2-BERT-ASR-Training
Users that are interested in W2V2-BERT-ASR-Training are comparing it to the libraries listed below
Sorting:
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆13Nov 14, 2024Updated last year
- ☆17May 5, 2024Updated last year
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- A TTS model that makes a speaker speak new languages☆76Jun 18, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- Wav2vec 2.0 Self-Supervised Pretraining☆59Feb 6, 2025Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- ☆25Mar 6, 2024Updated 2 years ago
- 🩺🎧 Fix all your podcast, video or live stream audio! 🎧🩺☆27Jun 3, 2024Updated last year
- Tacotron2 for Korean (taKotron2)☆34Apr 8, 2022Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- Implementation of Google's USM speech model in Pytorch☆35Feb 7, 2026Updated last month
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Apr 23, 2023Updated 2 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆39Feb 20, 2025Updated last year
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆38Nov 30, 2023Updated 2 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- 🥑 Intellij plugin to optimization Vector Drawable 🥑☆11Apr 7, 2019Updated 6 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"☆29Dec 15, 2025Updated 2 months ago
- Modern, fast and ergonomic C++ HTTP/1.1, HTTP/2 and WebSocket server library for Linux, perfect for microservices.☆34Updated this week
- A beginner-friendly inference to finetune & run inference on open TTS models 🗣️☆28Feb 4, 2026Updated last month
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆94Dec 3, 2024Updated last year
- ☆40Jan 14, 2022Updated 4 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Aug 11, 2023Updated 2 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- "Artificial General Intelligence For All (AGIFA)" Project☆12Feb 25, 2024Updated 2 years ago
- A lovely structopt library for C++! Parse command line arguments by defining a struct! ❤️☆11Apr 24, 2023Updated 2 years ago
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Colab notebooks for d2l-book☆11Dec 5, 2019Updated 6 years ago
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆16Sep 19, 2025Updated 5 months ago
- ☆26Nov 3, 2025Updated 4 months ago