zabir-nabil / audioperm
A python library for generating different permutations of audible segments from audio files.
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for audioperm
- A rugged Qt GUI application for processing webcam frames for ML applications (pose estimation)☆13Updated last year
- ☆17Updated last year
- Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.☆25Updated 5 years ago
- SCAR-Net, Submission to the Cooking Activity Recognition Challenge, ABC: competition track☆11Updated last year
- Urban Sound Classification : striving towards a fair comparison☆16Updated 3 years ago
- Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)☆17Updated last year
- Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library☆87Updated 3 weeks ago
- A simple wrapper to localize human joints from images/video frames for multiple subjects.☆13Updated last year
- A comprehensive collection of multilingual datasets and large language models, meticulously curated for evaluating and enhancing the perf…☆15Updated 5 months ago
- Classification of ECG signals by dot Residual LSTM Network for anomaly detection☆21Updated 4 years ago
- Emotional Video to Audio Transformation with ANFIS-DeepRNN (Vanilla RNN and LSTM-DeepRNN) [MPE 2020]☆25Updated 4 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆42Updated 2 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- Transformer based Bangla Speech Recognition☆51Updated last year
- ☆17Updated 4 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated last year
- Audio data augmentation examples☆35Updated 6 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 5 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆12Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆23Updated 4 years ago
- Emotion recognition library for PyTorch☆20Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆42Updated 3 years ago
- Features from audio: Spectrogram, (Wavelet Transform) Scalogram, (Q Transform) Spectrogram☆17Updated 6 years ago
- Best Collection of Articles and code for Audio Classification☆16Updated 5 years ago
- Bangla news classification and generation☆22Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- my codes for learning attention mechanism☆50Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago