ksanjeevan / torchparse
PyTorch Model Parser: Easily define models in .cfg file(s)
☆19Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for torchparse
- Sound Related Deep Learning Tasks boosting repository with pytorch☆86Updated 3 months ago
- Comprehensive Python library for speech and voice.☆33Updated last year
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆48Updated 4 years ago
- Utils and data sets for audio and PyTorch☆83Updated 2 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Speaker recognition ,Voiceprint recognition☆51Updated 4 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Updated 3 years ago
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆40Updated last year
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "☆44Updated 4 years ago
- A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation☆137Updated 3 months ago
- Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…☆52Updated 9 months ago
- Baseline systems for the FSD50K dataset☆67Updated 3 years ago
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆71Updated 4 months ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- ☆34Updated 5 years ago
- implement Wave-U-Net by pytorch☆56Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆57Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Simple Speech Keyword Detecting with Depthwise Separable Convolutions | DLology☆41Updated 6 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year