unreal79 / pic2wavLinks
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Updated 2 years ago
Alternatives and similar repositories for pic2wav
Users that are interested in pic2wav are comparing it to the libraries listed below
Sorting:
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆11Updated 6 months ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Updated 3 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Software that performs the separation of vocals from music using neural networks (part of my Bachelor's thesis).☆30Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆68Updated 4 years ago
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Hum2Song: Multi-track Polyphonic Music Generation from Voice Melody Transcription with Neural Networks☆136Updated 2 years ago
- Cicada is an open source audio annotation tool that lets you annotate audio files in .wav format and also enables you to look into each a…☆23Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 3 months ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆14Updated last year
- Finally, some decent sample sentences☆23Updated last year
- ☆32Updated 3 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆24Updated 5 months ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- ☆17Updated 4 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 7 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆87Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago