Wav2vec resources and models for Brazilian Portuguese
☆37Jul 15, 2022Updated 3 years ago
Alternatives and similar repositories for wav2vec4bp
Users that are interested in wav2vec4bp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- 🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro☆75Jul 22, 2025Updated 8 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆62Apr 11, 2023Updated 2 years ago
- A Hackable speech recognition library.☆25Oct 16, 2024Updated last year
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 5 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper☆12Oct 19, 2023Updated 2 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆58May 26, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- Itzuli® Machine Translation Engine API libraries☆11Feb 2, 2026Updated last month
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆10Mar 29, 2021Updated 5 years ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- ☆26Jun 5, 2024Updated last year
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 8 months ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- ☆67Aug 16, 2023Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆34Jul 31, 2024Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- This is a plugin for ImageJ2 for multifractal analysis of 2D and 3D images. Cite: MULTIFRAC: An ImageJ plugin for multiscale characteriza…☆12Aug 28, 2020Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".☆17Aug 12, 2020Updated 5 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Jan 11, 2023Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆470Sep 20, 2023Updated 2 years ago