biendltb / torch-istft-onnxView external linksLinks
An onnx-exportable implementation of iSTFT in torch
☆32Feb 19, 2025Updated 11 months ago
Alternatives and similar repositories for torch-istft-onnx
Users that are interested in torch-istft-onnx are comparing it to the libraries listed below
Sorting:
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 2 months ago
- Train finite-state grapheme-to-phoneme transducers☆13Feb 4, 2025Updated last year
- ☆23Aug 4, 2025Updated 6 months ago
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆37Jan 22, 2026Updated 3 weeks ago
- ☆14Aug 19, 2024Updated last year
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Dec 12, 2024Updated last year
- ☆24Mar 29, 2025Updated 10 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Oct 30, 2025Updated 3 months ago
- Binaural Spatializer Audio Plugin☆23Jun 25, 2024Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 7 months ago
- ☆16Dec 18, 2023Updated 2 years ago
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆91Jan 14, 2026Updated last month
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆43Aug 24, 2024Updated last year
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- ☆19May 2, 2024Updated last year
- Plugin to do stereo widening with decorrelation☆46May 10, 2025Updated 9 months ago
- Framework for differentiable black-box and gray-box audio effects modeling☆108Nov 8, 2025Updated 3 months ago
- Phoneme alignment representation compatible with multiple forced aligners☆22Apr 12, 2024Updated last year
- A collection of basic effects, available as open-source (MIT) C++ classes.☆98Jan 25, 2026Updated 3 weeks ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- Free, open-source, liberally-licensed, 2-dimensional audio spectrum analyzer.☆26Aug 28, 2025Updated 5 months ago
- ☆27Sep 5, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 3 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated 9 months ago
- High quality text-to-speech based on StyleTTS 2.☆72Updated this week
- ☆70Jan 25, 2025Updated last year
- ☆31Dec 21, 2023Updated 2 years ago
- A Neural Recorder plug to make the process of cloning external soft/hardware a bit more comfortable☆31Nov 25, 2023Updated 2 years ago
- ☆58Jun 28, 2024Updated last year
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆62Apr 14, 2024Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- ☆25Aug 2, 2024Updated last year
- ☆138Sep 8, 2025Updated 5 months ago
- Self-contained voice activity detector☆36Dec 16, 2025Updated 2 months ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆50Sep 20, 2025Updated 4 months ago