zabir-nabil / audiopermLinks
A python library for generating different permutations of audible segments from audio files.
☆13Updated 2 years ago
Alternatives and similar repositories for audioperm
Users that are interested in audioperm are comparing it to the libraries listed below
Sorting:
- A rugged Qt GUI application for processing webcam frames for ML applications (pose estimation)☆13Updated 2 years ago
- ☆17Updated 2 years ago
- Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.☆25Updated 5 years ago
- SCAR-Net, Submission to the Cooking Activity Recognition Challenge, ABC: competition track☆11Updated 2 years ago
- Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]☆26Updated 5 years ago
- Classification of ECG signals by dot Residual LSTM Network for anomaly detection☆21Updated 5 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆43Updated 4 years ago
- A simple wrapper to localize human joints from images/video frames for multiple subjects.☆13Updated 2 years ago
- ☆24Updated 6 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- ☆12Updated 4 years ago
- Image and video processing toolbox☆10Updated 4 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Siamese network for unsupervised speech representation learning☆11Updated 6 years ago
- ☆29Updated 5 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated 10 months ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- Conditioned U-Net for Music Source Separation☆20Updated 4 years ago
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆29Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Dynamic Time Warping algorithm for the Physionet Challenge 2016☆15Updated 8 years ago
- Baseline systems for the FSD50K dataset☆69Updated 3 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆52Updated 5 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 2 months ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago