unreal79 / pic2wav
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Updated last year
Related projects: ⓘ
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- Prepare spectrograms from audio for training a Riffusion model☆13Updated last year
- ☆23Updated this week
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- Uses machine learning to denoise audio containing speech☆28Updated 2 months ago
- Fork of AudioLDM as a TuneFlow plugin☆38Updated last year
- OpenAI's GPT2 based Music AI Google Colab Notebooks for Music Generation/Composition and Capabilities Evaluation☆43Updated 3 years ago
- Finally, some decent sample sentences☆21Updated 9 months ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- My guide to create an italian TTS with Coqui☆12Updated 2 years ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆40Updated 4 years ago
- Hum2Song: Multi-track Polyphonic Music Generation from Voice Melody Transcription with Neural Networks☆109Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated last year
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated last year
- [DEAD/NOT SUPPORTED ANYMORE] This is the only fully working and functioning version of Google Magenta Piano Transformer Colab Notebook.☆23Updated 2 years ago
- Ai generated music video with Riffusion and Gradio☆17Updated last year
- List of repositories relevant to VITS.☆35Updated last year
- Tools to create your own voice dataset for TTS training☆58Updated 3 years ago
- A fast MP3 decoder for python, using minimp3☆26Updated 2 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated last year
- OCTRA is a web-application for the orthographic transcription of audio files.☆35Updated last week
- Simple text to phonemes converter for multiple languages☆21Updated last year
- TTS Client for Coqui TTS server☆13Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Text prompt steered synthetic audio generators☆44Updated 9 months ago
- Do you think that AI can write songs for us? The project is just the music generator with the power of AI.☆33Updated 5 years ago
- GPT3-based Multi-Instrumental MIDI Music AI Implementation☆45Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆22Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆31Updated 4 months ago