A pytorch implementation of MFCC.
☆33Jun 1, 2022Updated 3 years ago
Alternatives and similar repositories for pytorch-mfcc
Users that are interested in pytorch-mfcc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Aug 30, 2019Updated 6 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆79Aug 19, 2022Updated 3 years ago
- Targeted Adversarial Examples for Black Box Audio Systems☆69Aug 27, 2020Updated 5 years ago
- Adversarial Attacks☆21Oct 11, 2021Updated 4 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- Adversarial Attacks☆61Mar 22, 2019Updated 7 years ago
- ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".☆12Apr 3, 2019Updated 6 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Official Implementation of "Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models"☆18Sep 6, 2023Updated 2 years ago
- 小样本检测☆18May 28, 2021Updated 4 years ago
- An implement of SPEECHSPLIT☆15Sep 12, 2020Updated 5 years ago
- ☆11Apr 18, 2021Updated 4 years ago
- Devil-Whisper-Attack☆36Mar 31, 2025Updated 11 months ago
- Repository for author masking☆13Oct 29, 2018Updated 7 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Code for "Improving Robustness of Vision Transformers by Reducing Sensitivity to Patch Corruptions"☆14Sep 3, 2023Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆13Apr 8, 2020Updated 5 years ago
- Converts CLIP models to ONNX☆10Jan 17, 2023Updated 3 years ago
- ☆14Mar 16, 2020Updated 6 years ago
- ☆25Nov 23, 2021Updated 4 years ago
- A pytorch implementation of "Ensemble Adversarial Training : Attacks and Defenses"☆10Sep 4, 2019Updated 6 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Code release for Grad-CAM Guided Attention Module for Fine-grained Visual Classification (MLSP 2022)☆13Aug 25, 2021Updated 4 years ago
- [AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models☆27Dec 14, 2023Updated 2 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- Fooling neural based speech recognition systems.☆14Jun 9, 2017Updated 8 years ago
- Targeted Adversarial Examples on Speech-to-Text systems☆309Jul 24, 2022Updated 3 years ago
- Implementation of Adversarial Attacks on GMM i-vector based Speaker Verification Systems (ICASSP2020) https://arxiv.org/abs/1911.03078☆35Mar 9, 2020Updated 6 years ago
- A pytorch implementation of xvector embedding☆79Mar 28, 2020Updated 5 years ago
- 5th place solution for ACM MM2021 Robust Logo Detection Grand Challenge☆13Dec 25, 2022Updated 3 years ago
- Adversarial attack and defense strategies for deep speaker recognition systems☆41Feb 18, 2021Updated 5 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆42Mar 9, 2023Updated 3 years ago
- Vietnamese diacritics restoration☆14Jan 18, 2016Updated 10 years ago
- The original code for the data providers and the datasets of the paper "Defining Benchmarks for Continual Few-Shot Learning".☆16Apr 15, 2020Updated 5 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆78Feb 27, 2025Updated last year