unreal79 / pic2wavLinks
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Updated 2 years ago
Alternatives and similar repositories for pic2wav
Users that are interested in pic2wav are comparing it to the libraries listed below
Sorting:
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Updated 2 months ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 3 years ago
- Streamlit app to visualize and edit TTS datasets☆15Updated 4 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Cicada is an open source audio annotation tool that lets you annotate audio files in .wav format and also enables you to look into each a…☆22Updated 3 years ago
- Software that performs the separation of vocals from music using neural networks (part of my Bachelor's thesis).☆30Updated 5 years ago
- Pytorch Implementation of wavegan model to generate audio☆171Updated 5 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆15Updated last year
- Tools to create your own voice dataset for TTS training☆69Updated 5 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Hum2Song: Multi-track Polyphonic Music Generation from Voice Melody Transcription with Neural Networks☆144Updated 2 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Updated 7 months ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- ☆32Updated 3 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week
- ☆17Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Updated 5 years ago
- ☆63Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Updated 3 years ago
- This is PyTorch Implementation of Neural Style Transfer Algorithm which is modified for Audios.☆84Updated 3 years ago
- [DEAD/NOT SUPPORTED ANYMORE] This is the only fully working and functioning version of Google Magenta Piano Transformer Colab Notebook.☆25Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 3 weeks ago