A pytorch implementation of MFCC.
☆33Jun 1, 2022Updated 3 years ago
Alternatives and similar repositories for pytorch-mfcc
Users that are interested in pytorch-mfcc are comparing it to the libraries listed below
Sorting:
- ☆36Aug 30, 2019Updated 6 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- Targeted Adversarial Examples for Black Box Audio Systems☆70Aug 27, 2020Updated 5 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Adversarial Attacks☆60Mar 22, 2019Updated 6 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆79Aug 19, 2022Updated 3 years ago
- Operating tools for texture bank files.☆10Nov 2, 2016Updated 9 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- Vietnamese diacritics restoration☆14Jan 18, 2016Updated 10 years ago
- Devil-Whisper-Attack☆36Mar 31, 2025Updated 11 months ago
- Global Open Simulator☆10May 5, 2025Updated 9 months ago
- Implementation of Adversarial Attacks on GMM i-vector based Speaker Verification Systems (ICASSP2020) https://arxiv.org/abs/1911.03078☆35Mar 9, 2020Updated 5 years ago
- Implementation of Phase-aware speech enhancement with deep complex U-Net☆41Oct 3, 2023Updated 2 years ago
- Directional sparse filtering for blind speech separation☆10Jun 8, 2021Updated 4 years ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- Code and some materials from the papers "Selection of Source Images Heavily Influences the Effectiveness of Adversarial Attacks" (BMVC 20…☆12Nov 23, 2021Updated 4 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- https://youtu.be/pE7UOYioPKk☆10Feb 16, 2023Updated 3 years ago
- Greedy Adaptive Dictionary (GAD) is a learning algorithm that sets out to find sparse atoms for speech signals.☆11Oct 1, 2018Updated 7 years ago
- evaluation of shot detection results using the RAI dataset☆10Jun 7, 2018Updated 7 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- A Pytorch implementation of triplet loss on VoxCeleb1☆12Oct 16, 2019Updated 6 years ago
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- 开发成长路上☆10Dec 25, 2018Updated 7 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- Adversarial attack and defense strategies for deep speaker recognition systems☆42Feb 18, 2021Updated 5 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆42Mar 9, 2023Updated 2 years ago
- This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets re…☆13Oct 8, 2025Updated 4 months ago
- ☆11Mar 22, 2023Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- Fooling neural based speech recognition systems.☆14Jun 9, 2017Updated 8 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- ☆10Jan 29, 2019Updated 7 years ago
- Some PyTorch code for the Kaggle Speech Recognition Challenge☆12Feb 7, 2019Updated 7 years ago
- useful things that work with NVIDIA NeMo library☆14Jan 20, 2024Updated 2 years ago