stephengrice / synth-meView external linksLinks
Basic concatenative text-to-speech implementation in Python
☆19Aug 31, 2019Updated 6 years ago
Alternatives and similar repositories for synth-me
Users that are interested in synth-me are comparing it to the libraries listed below
Sorting:
- Python Hindi Concatenative Based TTS using Phoneme Database☆25Feb 2, 2022Updated 4 years ago
- CoaT: Co-Scale Conv-Attentional Image Transformers☆16Apr 20, 2021Updated 4 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- vq-wav2vec inference☆13Dec 13, 2021Updated 4 years ago
- Frontend system for HMM-based speech synthesis models generated by HTS.☆40Apr 5, 2021Updated 4 years ago
- A Multi_Scale LSTM Model.☆18Mar 31, 2017Updated 8 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Dec 31, 2021Updated 4 years ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logi…☆10Jan 22, 2025Updated last year
- working on parallel wavenet☆25Apr 19, 2018Updated 7 years ago
- Algorithmic problems, solutions and a variety of visualizations. Specific examples are provided.☆11Mar 2, 2022Updated 3 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆42Sep 5, 2025Updated 5 months ago
- Training code and dataset cleasing with Sidon☆76Jan 16, 2026Updated 3 weeks ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- A minimal and interpretable Brian2 based DYNAP neuromorphic processor simulator for educational purposes.☆12Jun 23, 2022Updated 3 years ago
- real time face swap and one-click video deepfake with only a single image☆11Sep 13, 2024Updated last year
- Python crossplatform library for Mac/linux and widows os.Complete system command, send alert, notifications, set brightness, recording au…☆11Apr 25, 2025Updated 9 months ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 4 months ago
- A hand-gesture recognition system using Doppler effect of ultrasonic.☆11Mar 2, 2019Updated 6 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84May 23, 2023Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆46Aug 20, 2024Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Jan 4, 2023Updated 3 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆43Dec 6, 2022Updated 3 years ago
- Create short vertical videos for TikTok, YouTube Shorts, and Instagram Reels using AI. Fully automated pipeline with traceability. 🚀🎥☆16Feb 8, 2026Updated last week
- Sublime Text 3 plugin for voice coding Python 3☆13Sep 15, 2022Updated 3 years ago
- This repository expects to be a place to find code/resources/examples and more, related to the NTUA lambda flow.☆10Feb 12, 2019Updated 7 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- Supply chain planning using max flow formulated as mixed integer linear programming☆10May 20, 2020Updated 5 years ago
- BlockCIrculantRNN (LSTM and GRU) using TensorFlow☆14Oct 30, 2018Updated 7 years ago
- An awesome list that curates the best Flet tools, tutorials, blogs and more.☆10Jan 8, 2023Updated 3 years ago
- BioVoice: a multipurpose tool for voice analysis☆11Nov 13, 2020Updated 5 years ago
- RSSI-based OFDM signal classification using a machine learning algorithm.☆12May 15, 2018Updated 7 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 9 months ago
- A machine learning algorithm that estimates the directions of arrival and relative levels of an arbitrary number of sound sources using r…☆12Dec 10, 2022Updated 3 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Karaokey is a vocal remover that automatically separates the vocals and instruments. A deep learning model based on LSTMs has been traine…☆42Jul 6, 2023Updated 2 years ago
- An 16kHz implementation of HiFi-GAN for soft-vc.☆105Jul 19, 2023Updated 2 years ago