csyan5 / AttnGAN-Audio-to-image-geneation
CMPT726 Machine Learning Final Project
☆11Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for AttnGAN-Audio-to-image-geneation
- Audio Classification using Image Classification☆49Updated 4 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Updated 4 years ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆40Updated 4 years ago
- ☆29Updated 6 years ago
- Generate embedding vectors from audio files☆56Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- OpenAI's GPT2 based Music AI Google Colab Notebooks for Music Generation/Composition and Capabilities Evaluation☆43Updated 3 years ago
- Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"☆28Updated 5 years ago
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26Updated 7 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.☆13Updated last year
- This is a k-means clustering algorithm implementation for audio signals clustering☆22Updated 10 months ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- Removing background noise in a sound file☆62Updated 5 years ago
- generate lyrics with GPT-2☆37Updated 5 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 7 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago
- Simple text to phonemes converter for multiple languages☆20Updated last year
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- Trains a convolutional autoencoder on Mel Spectrogram images for a list of songs, then displays the encoded latent features using t-SNE.☆20Updated 7 years ago
- Audio Analysis by Conceptor☆30Updated 9 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- Automatic Speech Recognition Dataset Generation☆36Updated 6 years ago
- Training Wavenet on Sylvia Plath Audio Clips☆9Updated 6 years ago
- Vue app for https://github.com/bearpelican/musicautobot☆15Updated last year
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆58Updated 6 years ago
- This repository main point is to implement Generative Adversarial Networks (GANs) and Style Transfer Methods that can create new audio sa…☆42Updated 5 years ago