biendltb/torch-istft-onnx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/biendltb/torch-istft-onnx)

biendltb / torch-istft-onnx

An onnx-exportable implementation of iSTFT in torch

☆35

Alternatives and similar repositories for torch-istft-onnx

Users that are interested in torch-istft-onnx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
DakeQQ / STFT-ISTFT-ONNX
View on GitHub
Export the STFT or ISTFT process in ONNX format.
☆47Jun 6, 2026Updated last month
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Ircam-Partiels / crepe-vamp-plugin
View on GitHub
The Crepe plugin is an implementation of the CREPE monophonic pitch tracker, based on a deep convolutional neural network operating direc…
☆16Dec 1, 2025Updated 7 months ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
FENRlR / MB-iSTFT-VITS2
View on GitHub
Application of MB-iSTFT-VITS components to vits2_pytorch
☆135Dec 29, 2025Updated 7 months ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
brummer10 / neuralrecord
View on GitHub
A Neural Recorder plug to make the process of cloning external soft/hardware a bit more comfortable
☆32Nov 25, 2023Updated 2 years ago
supertone-inc / super-monotonic-align
View on GitHub
☆173Sep 19, 2024Updated last year
tadmn / spectrum
View on GitHub
Free GPU accelerated cross-platform audio spectrum analyzer (VST3, CLAP, AUv2)
☆28Updated this week
freds0 / free-svc
View on GitHub
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
☆95Jul 23, 2025Updated last year
lars76 / swift-f0
View on GitHub
Fast and accurate fundamental frequency (F0) detector using convolutional neural networks
☆175Sep 2, 2025Updated 10 months ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆152Aug 22, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated last year
b04901014 / vae-gslm
View on GitHub
Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models
☆24Jun 18, 2025Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
haoweilou / ParaStyleTTS
View on GitHub
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive …
☆59Dec 21, 2025Updated 7 months ago
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
neuphonic / neucodec
View on GitHub
A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.
☆161Jun 22, 2026Updated last month
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vtuber-plan / NSF-HiFiGAN
View on GitHub
Vocoder NSF-HiFiGAN (Moved into deepaudio)
☆56Dec 11, 2022Updated 3 years ago
BarcelonaMedia-Audio / idhoa
View on GitHub
Software for Decoding of High Order Ambisonics to Irregular Layouts
☆13Mar 20, 2014Updated 12 years ago
vackva / Orbe
View on GitHub
Binaural Spatializer Audio Plugin
☆25Jun 25, 2024Updated 2 years ago
JeffMcClintock / SE2JUCE
View on GitHub
☆12Jan 28, 2026Updated 6 months ago
xincanfeng / vitsGPT
View on GitHub
☆60Jun 28, 2024Updated 2 years ago
BakerBunker / FreeV
View on GitHub
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆98Jul 4, 2024Updated 2 years ago
HSUNEH / DOSE
View on GitHub
☆19Sep 22, 2025Updated 10 months ago