unreal79 / pic2wavLinks
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Updated 2 years ago
Alternatives and similar repositories for pic2wav
Users that are interested in pic2wav are comparing it to the libraries listed below
Sorting:
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 3 months ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆13Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.☆14Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆34Updated 11 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Updated last month
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆35Updated last month
- Heteronym to Phoneme Parser☆18Updated last year
- Remixing Music with Deep Learning☆15Updated 8 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated 3 months ago
- 🔀 Strange combinations converter like Audio <-> Image☆20Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Generation of musical phrases that receive maximum score according to configurable evaluational criteria.☆12Updated last year
- Mellotron singing synthesizer using CPU☆13Updated 2 years ago
- Cicada is an open source audio annotation tool that lets you annotate audio files in .wav format and also enables you to look into each a…☆23Updated 2 years ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- OpenAI's GPT2 based Music AI Google Colab Notebooks for Music Generation/Composition and Capabilities Evaluation☆45Updated 4 years ago
- Finally, some decent sample sentences☆23Updated last year
- Tools to create your own voice dataset for TTS training☆65Updated 4 years ago
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Music Generative Pretrained Transformer☆27Updated 2 years ago
- This is PyTorch Implementation of Neural Style Transfer Algorithm which is modified for Audios.☆81Updated 3 years ago