zabir-nabil / audiopermLinks
A python library for generating different permutations of audible segments from audio files.
☆13Updated 3 years ago
Alternatives and similar repositories for audioperm
Users that are interested in audioperm are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
 - Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
 - Classifying 10 different categories of Sound using Deep Learning.☆25Updated 7 years ago
 - Pytorch Code for S2IGAN☆41Updated 5 years ago
 - Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆29Updated 5 years ago
 - Comprehensive Python library for speech and voice.☆32Updated 2 years ago
 - SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 6 years ago
 - Audio data augmentation examples☆34Updated 7 years ago
 - Unofficial Implementation of MLP-Mixer in TensorFlow☆27Updated 4 years ago
 - Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.☆25Updated 6 years ago
 - Best Collection of Articles and code for Audio Classification☆15Updated 6 years ago
 - Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
 - ☆65Updated 7 years ago
 - Contrastive Language-Audio Pretraining☆15Updated 4 years ago
 - Utils and data sets for audio and PyTorch☆86Updated 3 years ago
 - Automated Lip Reading using Deep Reinforcement Learning☆32Updated 7 years ago
 - Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Updated 4 years ago
 - Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆60Updated 2 years ago
 - Audio Classification using Image Classification☆48Updated 5 years ago
 - Code accompanying ISMIR'19 paper titled "Learning to Traverse Latent Spaces for Musical Score Inpaintning"☆47Updated 4 years ago
 - ☆26Updated 6 years ago
 - ☆19Updated 5 years ago
 - A rugged Qt GUI application for processing webcam frames for ML applications (pose estimation)☆13Updated 2 years ago
 - Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]☆25Updated 5 years ago
 - Enhancment of Audio Quality (Bit-Depth and Sampling-Rate) using Deep Learning.☆33Updated 5 years ago
 - Feature extractor for DL speech processing.☆66Updated 3 years ago
 - https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26Updated 8 years ago
 - ☆12Updated 5 years ago
 - Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
 - Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago