unreal79 / pic2wav
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pic2wav
- My guide to create an italian TTS with Coqui☆14Updated 2 years ago
- Symbolic music generation taking inspiration from NLP and human composition process☆17Updated last year
- 🔀 Strange combinations converter like Audio <-> Image☆19Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆13Updated last year
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".☆16Updated 4 years ago
- Finally, some decent sample sentences☆22Updated 11 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆17Updated last month
- Tools to create your own voice dataset for TTS training☆61Updated 4 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- Mellotron singing synthesizer using CPU☆13Updated last year
- Do you think that AI can write songs for us? The project is just the music generator with the power of AI.☆34Updated 5 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- ☆32Updated 2 years ago
- OpenAI's GPT2 based Music AI Google Colab Notebooks for Music Generation/Composition and Capabilities Evaluation☆43Updated 3 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆24Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆32Updated last month
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- A python package for high level musical data manipulation and preprocessing, making data ready to be fed to a neural network.☆40Updated 2 years ago
- Remixing Music with Deep Learning☆14Updated 8 years ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆40Updated 4 years ago
- Hum2Song: Multi-track Polyphonic Music Generation from Voice Melody Transcription with Neural Networks☆114Updated last year
- Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published a…☆37Updated 2 years ago
- Full LAKH MIDI dataset converted to MuseNet MIDI output format (9 instruments + drums)☆17Updated 2 years ago
- [DEAD/NOT SUPPORTED ANYMORE] This is the only fully working and functioning version of Google Magenta Piano Transformer Colab Notebook.☆25Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆32Updated 9 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- Detect individual instruments activity in an audio file. 🎤🎹🎸🥁☆15Updated 3 years ago