lucasgris/wav2vec4bp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucasgris/wav2vec4bp)

lucasgris / wav2vec4bp

Wav2vec resources and models for Brazilian Portuguese

☆36

Alternatives and similar repositories for wav2vec4bp

Users that are interested in wav2vec4bp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
igormq / speech2text
View on GitHub
☆12Feb 9, 2021Updated 5 years ago
falabrasil / speech-datasets
View on GitHub
🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro
☆79Jun 15, 2026Updated last month
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Edresson / Wav2Vec-Wrapper
View on GitHub
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆80May 20, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nilc-nlp / CORAA
View on GitHub
☆64Apr 11, 2023Updated 3 years ago
scart97 / thunder-speech
View on GitHub
A Hackable speech recognition library.
☆25Oct 16, 2024Updated last year
sandraavila / vsumm
View on GitHub
This repository contains the data (datasets, video/user summaries, CUS evaluation, and results) from the paper "VSUMM: A mechanism design…
☆15Oct 13, 2024Updated last year
falabrasil / gitlab-resources
View on GitHub
This is a legacy repo. Dev occurs now on GitHub.
☆11Mar 28, 2021Updated 5 years ago
rmarcacini / ser-coraa-pt-br
View on GitHub
Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech
☆22Mar 21, 2022Updated 4 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Updated this week
bryan051003 / USVG
View on GitHub
A unified model for zero-shot singing voice conversion and synthesis
☆22Nov 30, 2022Updated 3 years ago
sotaque-brasileiro / sotaque-brasileiro
View on GitHub
Uma base de dados para estudo de regionalismos brasileiros através da voz.
☆11May 2, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
taskswithcode / sota_researchers_with_published_code
View on GitHub
Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper
☆12Oct 19, 2023Updated 2 years ago
falabrasil / kaldi-br
View on GitHub
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
☆60May 26, 2022Updated 4 years ago
Vicomtech / itzuli-api-lib
View on GitHub
Itzuli® Machine Translation Engine API libraries
☆11Feb 2, 2026Updated 5 months ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
maxidl / wav2vec2
View on GitHub
☆10Mar 29, 2021Updated 5 years ago
maum-ai / sane-tts
View on GitHub
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
☆11Jun 30, 2023Updated 3 years ago
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
ex3ndr / supervoice-enhance
View on GitHub
Supervoice diffusion enhance
☆28Jul 15, 2024Updated 2 years ago
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Dapwner / CVAE-Tacotron
View on GitHub
☆26Jun 5, 2024Updated 2 years ago
larocs / PraCegoVer
View on GitHub
#PraCegoVer is a multi-modal dataset containing images associated to Portuguese captions based on posts from Instagram.
☆10Jun 7, 2024Updated 2 years ago
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
falabrasil / ufpalign
View on GitHub
👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro
☆13Jul 18, 2025Updated last year
msalhab96 / Listen-Attend-and-Spell
View on GitHub
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
☆12Mar 4, 2022Updated 4 years ago
sony / diffiner
View on GitHub
☆68Aug 16, 2023Updated 2 years ago
HeliosZhao / Shot-Boundary-Detection
View on GitHub
☆15Aug 3, 2019Updated 6 years ago
ivangtorre / multifrac
View on GitHub
This is a plugin for ImageJ2 for multifractal analysis of 2D and 3D images. Cite: MULTIFRAC: An ImageJ plugin for multiscale characteriza…
☆12Aug 28, 2020Updated 5 years ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆19Jun 12, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
johndpope / Singing-Voice-Conversion-with-conditional-VAW-GAN
View on GitHub
This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".
☆17Aug 12, 2020Updated 5 years ago
fabianoluzbr / neural-g2p-portuguese
View on GitHub
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…
☆19Jun 14, 2021Updated 5 years ago
dobby-seo / Wav2Keyword
View on GitHub
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆110Jan 11, 2023Updated 3 years ago