Wav2vec resources and models for Brazilian Portuguese
☆37Jul 15, 2022Updated 3 years ago
Alternatives and similar repositories for wav2vec4bp
Users that are interested in wav2vec4bp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- ☆64Apr 11, 2023Updated 3 years ago
- 🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro☆78Jul 22, 2025Updated 8 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- A Hackable speech recognition library.☆25Oct 16, 2024Updated last year
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆22Mar 21, 2022Updated 4 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 5 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper☆12Oct 19, 2023Updated 2 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆58May 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- Itzuli® Machine Translation Engine API libraries☆11Feb 2, 2026Updated 2 months ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆10Mar 29, 2021Updated 5 years ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- ☆26Jun 5, 2024Updated last year
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 9 months ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- ☆68Aug 16, 2023Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- This is a plugin for ImageJ2 for multifractal analysis of 2D and 3D images. Cite: MULTIFRAC: An ImageJ plugin for multiscale characteriza…☆12Aug 28, 2020Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Collection of scripts from mHuBERT-147.☆34Nov 19, 2024Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆37Jul 31, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".☆17Aug 12, 2020Updated 5 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Jan 11, 2023Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆471Sep 20, 2023Updated 2 years ago