An onnx-exportable implementation of iSTFT in torch
☆32Feb 19, 2025Updated last year
Alternatives and similar repositories for torch-istft-onnx
Users that are interested in torch-istft-onnx are comparing it to the libraries listed below
Sorting:
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated last year
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 3 months ago
- Train finite-state grapheme-to-phoneme transducers☆14Feb 4, 2025Updated last year
- ☆23Aug 4, 2025Updated 7 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated 2 weeks ago
- ☆14Aug 19, 2024Updated last year
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆38Jan 22, 2026Updated last month
- Check for memory allocations in a specific thread, to validate your code for real-time purposes☆13Apr 29, 2024Updated last year
- ☆25Mar 29, 2025Updated 11 months ago
- Binaural Spatializer Audio Plugin☆23Jun 25, 2024Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 8 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Aug 24, 2024Updated last year
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆91Jan 14, 2026Updated last month
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 11 months ago
- ☆19May 2, 2024Updated last year
- Plugin to do stereo widening with decorrelation☆47May 10, 2025Updated 10 months ago
- Framework for differentiable black-box and gray-box audio effects modeling☆111Nov 8, 2025Updated 4 months ago
- Phoneme alignment representation compatible with multiple forced aligners☆22Apr 12, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- A collection of basic effects, available as open-source (MIT) C++ classes.☆99Jan 25, 2026Updated last month
- Free, open-source, liberally-licensed, 2-dimensional audio spectrum analyzer.☆26Aug 28, 2025Updated 6 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated 10 months ago
- ☆28Sep 5, 2024Updated last year
- High quality text-to-speech based on StyleTTS 2.☆73Feb 25, 2026Updated last week
- ☆70Jan 25, 2025Updated last year
- ☆30Dec 21, 2023Updated 2 years ago
- A Neural Recorder plug to make the process of cloning external soft/hardware a bit more comfortable☆31Nov 25, 2023Updated 2 years ago
- ☆58Jun 28, 2024Updated last year
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆61Apr 14, 2024Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- ☆25Aug 2, 2024Updated last year
- ☆140Sep 8, 2025Updated 6 months ago
- Self-contained voice activity detector☆37Dec 16, 2025Updated 2 months ago