rhasspy/wav2mel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rhasspy/wav2mel)

rhasspy / wav2mel

Transform audio files into mel spectrograms for text-to-speech model training

☆12

Alternatives and similar repositories for wav2mel

Users that are interested in wav2mel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JayAhn0104 / Recommender-System-PyTorch
View on GitHub
Recommendation Model Implementation by using PyTorch
☆10Nov 1, 2022Updated 3 years ago
HudsonHuang / waveglow_vocoder
View on GitHub
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
☆16Feb 9, 2025Updated last year
garceling / Traffic-Monitoring-on-RPI
View on GitHub
Setup of RPI4+ Vehicle Detection and Distance Measurement+ GPS + Distracted Driving Detection
☆10Nov 5, 2021Updated 4 years ago
rhasspy / glow-tts-train
View on GitHub
An implementation of GlowTTS designed to work with Gruut
☆12Mar 9, 2022Updated 4 years ago
gulico / Plants_vs_Zombies
View on GitHub
植物大战僵尸纯c++&SDK
☆10Jun 28, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ibaiGorordo / ONNX-TopFormer-Semantic-Segmentation
View on GitHub
Python scripts performing semantic segmentation using the TopFormer model in ONNX.
☆17Apr 14, 2022Updated 4 years ago
rubencg195 / rl-training-quadruped-robot-pybullet-openai-gym
View on GitHub
☆17Oct 27, 2025Updated 8 months ago
ibaiGorordo / depthai-TopFormer-Semantic-Segmentation
View on GitHub
Python scripts performing on devive semantic segmentation using the TopFormer model in depthai.
☆23Apr 16, 2022Updated 4 years ago
jaeyeun97 / MelNet
View on GitHub
A Pytorch Implementation of MelNet
☆26Apr 13, 2020Updated 6 years ago
rhasspy / espeak-phonemizer
View on GitHub
Uses ctypes and libespeak-ng to transform test into IPA phonemes
☆26Sep 20, 2023Updated 2 years ago
lassse / sound-controlled-intergalactic-teddy
View on GitHub
Infinite runner game where you use your voice and sounds to play.
☆26Sep 27, 2024Updated last year
mmorise / no7_singing
View on GitHub
☆14Oct 11, 2024Updated last year
SUC-DriverOld / wav2svp
View on GitHub
wav2svp: Waveform & pitchs to Synthesizer V Project
☆17Jan 9, 2025Updated last year
bayjarvis / llm
View on GitHub
Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ
☆15Jul 5, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yxlllc / vocal-remover
View on GitHub
Vocal Remover using Deep Neural Networks
☆21Dec 31, 2024Updated last year
mkvenkit / simple_audio_pi
View on GitHub
Simple Audio Recognition on the Raspberry Pi using Machine Learning.
☆23Jan 17, 2021Updated 5 years ago
vniehues / homebridge-saphi-tv
View on GitHub
This plug-in provides Homebridge support for Philips TVs running SaphiOS.
☆10Apr 22, 2023Updated 3 years ago
SangwonSUH / realtime_YAMNET
View on GitHub
Simple real-time Sound Event Detector based on YAMNet and pyaudio.
☆23Jan 16, 2020Updated 6 years ago
AhlemGit / Arabic-WordNet-To-SQLite
View on GitHub
This repository is about how to build an SQLite version of the Arabic WordNet database.
☆11Mar 19, 2019Updated 7 years ago
Syuparn / TextGridConverter
View on GitHub
convert .lab files to .TextGrid files, which can be used in Praat
☆14Nov 2, 2018Updated 7 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
sdbds / LECO
View on GitHub
Low-rank adaptation for Erasing COncepts from diffusion models.
☆16Oct 18, 2023Updated 2 years ago
salimkanoun / Orthanc_Tools
View on GitHub
DICOM tools built on Orthanc API in Java
☆31Mar 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
logicwind / react-native-fold-detection
View on GitHub
react-native library to detect fold device and also provide information related to fold status
☆20Nov 12, 2025Updated 8 months ago
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 4 years ago
bootphon / abkhazia
View on GitHub
ABX and kaldi experiments on speech corpora made easy
☆34Oct 7, 2024Updated last year
camenduru / audiogen-colab
View on GitHub
☆27Aug 3, 2023Updated 2 years ago
vikhyat / e_natten
View on GitHub
Blazingly fast neighborhood attention
☆15Nov 28, 2023Updated 2 years ago
manymuch / Natural-Noise-Generator
View on GitHub
☆10Aug 3, 2019Updated 6 years ago
nmhaddad / semantic-segmentation
View on GitHub
Off-Road Perception with DeepLabV3+
☆41Jul 22, 2025Updated last year
antonyharfield / tflite-models-audioset-yamnet
View on GitHub
A TFLite-compatible fork of YAMNet from tensorflow/models
☆31Jun 13, 2020Updated 6 years ago
Astel123457 / moresampler2
View on GitHub
moresampler2, written in c, with libllsm2 support
☆17May 30, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
GPUPhobia / vocal-mask
View on GitHub
☆12May 1, 2019Updated 7 years ago
shine5402 / KiraWavTar
View on GitHub
A utility to combine/extract WAV files
☆23May 5, 2026Updated 2 months ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
pettarin / ipapy
View on GitHub
ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings
☆94May 6, 2024Updated 2 years ago
muhdhuz / audio2spec
View on GitHub
Scripts to convert audio files to spectrograms and back
☆12Nov 23, 2017Updated 8 years ago
NightTrek / moondream-mcp
View on GitHub
☆19Jan 3, 2025Updated last year
zassou65535 / WaveGAN
View on GitHub
WaveGANによる音声生成器
☆13Feb 9, 2024Updated 2 years ago