zabir-nabil / audiopermLinks
A python library for generating different permutations of audible segments from audio files.
☆13Updated 3 years ago
Alternatives and similar repositories for audioperm
Users that are interested in audioperm are comparing it to the libraries listed below
Sorting:
- Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]☆25Updated 5 years ago
- Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.☆25Updated 5 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- ☆17Updated 2 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 7 years ago
- A rugged Qt GUI application for processing webcam frames for ML applications (pose estimation)☆13Updated 2 years ago
- A complete implementation of the Pytorch neural network framework for GAN☆24Updated 3 years ago
- A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication,…☆82Updated 2 weeks ago
- ☆26Updated 6 years ago
- 1D CNN based classifier for Speech Commands Dataset☆9Updated 7 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Pytorch Code for S2IGAN☆41Updated 5 years ago
- Collection of research papers on cough classification☆39Updated 5 years ago
- Different methods and techniques for features extraction from audio☆56Updated last year
- Machine Learning Sound Classifier☆137Updated 5 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆60Updated 2 years ago
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆30Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- SpeechYOLO Interspeech 2019☆44Updated 2 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- TensorFlow implementation of "GANSynth: Adversarial Neural Audio Synthesis"☆67Updated 6 years ago
- ☆12Updated 4 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- My Dream is that each one of these code snippets will become a blog post. So let's take this dream one snippet at a time :)☆35Updated 5 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Code accompanying ISMIR'19 paper titled "Learning to Traverse Latent Spaces for Musical Score Inpaintning"☆47Updated 4 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- The goal of this task is to automatically recognize the emotions and themes conveyed in a music recording using machine learning algorith…☆38Updated 2 years ago