Tensorflow implementation of pix2pix for creating music from a voice. Vocals2Song.
☆17Sep 26, 2022Updated 3 years ago
Alternatives and similar repositories for Vocals2Song
Users that are interested in Vocals2Song are comparing it to the libraries listed below
Sorting:
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- House Layout Generation using GAN. Initial Idea from HouseGAN ++☆16May 5, 2023Updated 2 years ago
- sliding HPSS and two stage HPSS (singing voice enhancement)☆17Oct 9, 2020Updated 5 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Can Neural Networks reconstruct missing audio data? What about GANs?☆18Nov 6, 2019Updated 6 years ago
- Simple text to phonemes converter for multiple languages☆20Nov 21, 2022Updated 3 years ago
- A collection of papers I am interested in.☆29Apr 3, 2023Updated 2 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Jun 12, 2025Updated 8 months ago
- ☆25Jun 14, 2022Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.☆28Sep 13, 2025Updated 5 months ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Fork of the official kaldi.☆22Mar 22, 2022Updated 3 years ago
- Official repository for the paper "Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Co…☆24May 19, 2022Updated 3 years ago
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆33May 31, 2023Updated 2 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Jul 5, 2019Updated 6 years ago
- Easy setup for mmdetection☆22May 17, 2025Updated 9 months ago
- ☆12Jul 30, 2025Updated 7 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆31Sep 13, 2018Updated 7 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 4 months ago
- Music GAN - GANSynth preprocessing, ProGAN and DCGAN architecture☆11Jan 26, 2023Updated 3 years ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated 2 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- Urban block renewal maximizing outdoor thermal comfort using deep reinforcement learning methods.☆10Mar 3, 2022Updated 4 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- (IEEE TCSVT) 3DPortraitGAN: Learning One-Quarter Headshot 3D GANs from a Single-View Portrait Dataset with Diverse Body Poses☆33Jul 9, 2025Updated 8 months ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Sep 26, 2018Updated 7 years ago
- Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published a…☆40Oct 6, 2022Updated 3 years ago
- Updated ROS bindings to pocketsphinx☆38Oct 10, 2017Updated 8 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- A generative deep learning model based on GAN architecture was implemented to generate synthetic network data (benign and malicious) alik…☆10Oct 23, 2021Updated 4 years ago
- ImageQA is a tool for analyzing digital image quality according to specific attributes such as color, tone transfer, noise or resolution.…☆11Sep 18, 2024Updated last year
- A reddit scraping and analysis bot to visualize linguistic and content trends☆11Oct 5, 2021Updated 4 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated last year
- Simple implementation of TDOA localization algorithm.☆13Oct 12, 2016Updated 9 years ago