npuichigo / tarzanView external linksLinks
High-level API for tar-based dataset
☆12Feb 3, 2024Updated 2 years ago
Alternatives and similar repositories for tarzan
Users that are interested in tarzan are comparing it to the libraries listed below
Sorting:
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Jan 12, 2024Updated 2 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis☆20Jan 28, 2020Updated 6 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆47Mar 25, 2022Updated 3 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- API implementation of Song Source spleeting from Spleeter by Deezer☆13Mar 21, 2020Updated 5 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- ☆64May 3, 2024Updated last year
- Random collections of code examples.☆12Mar 19, 2025Updated 10 months ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")☆126Feb 24, 2024Updated last year
- ☆36Mar 14, 2025Updated 11 months ago
- Rust crate for some audio utilities☆27Mar 8, 2025Updated 11 months ago
- C++ p2300 proposal in Rust☆22Jan 31, 2026Updated 2 weeks ago
- Profiling and Improving the PyTorch Dataloader for high-latency Storage☆20Apr 18, 2023Updated 2 years ago
- ☆21Feb 27, 2024Updated last year
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Oct 14, 2019Updated 6 years ago
- Python wrapper for Sinsy☆53Oct 9, 2023Updated 2 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Speech synthesis platform based on tensorflow and sonnet☆60May 16, 2019Updated 6 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆167Apr 10, 2024Updated last year
- c++ Kaldi IO lib (static and dynamic).☆25Nov 26, 2018Updated 7 years ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Nov 18, 2024Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 2 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆211Sep 19, 2024Updated last year
- ☆68Jul 16, 2023Updated 2 years ago
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆73Feb 6, 2022Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Dec 6, 2018Updated 7 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆41Jan 4, 2026Updated last month
- OpenAI compatible API for TensorRT LLM triton backend☆220Aug 1, 2024Updated last year
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 4 years ago
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆36May 7, 2024Updated last year
- ☆34Jul 16, 2019Updated 6 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 9 months ago