echocatzh / torch-mfccLinks

A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.

☆78

Alternatives and similar repositories for torch-mfcc

Users that are interested in torch-mfcc are comparing it to the libraries listed below

Sorting:

yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆122Updated 3 years ago
jingyonghou / KWS_Max-pooling_RHE
Mining effective negative training samples for keyword spotting (PyTorch)
☆62Updated 5 years ago
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
☆50Updated 4 years ago
FFSVC / FFSVC2022_Baseline_System
☆32Updated 2 years ago
yuyq96 / D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
☆88Updated 2 years ago
iariav / End-to-End-VAD
an Audio-Visual Voice Activity Detection using Deep Learning
☆49Updated 6 years ago
gemengtju / SpEx_Plus
SpEx+(tied) source code
☆86Updated 2 years ago
linan2 / Voice-activity-detection-VAD-paper-and-code
Voice activity detection (VAD) paper and code（From 198*~ ）and its classification.
☆101Updated last month
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago
zzpDapeng / speech_data_augment
A summary of speech data augment algorithms
☆69Updated 4 years ago
MihawkHu / DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
☆87Updated 3 years ago
MrSupW / ICMC-ASR_Baseline
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆52Updated last year
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
jsvir / vad
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
☆32Updated 4 months ago
re9ulus / BC-ResNet
BC-ResNet for Keyword Spotting
☆39Updated 3 years ago
funcwj / aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
☆143Updated 2 years ago
dianwen-ng / Keyword-Spotting-ConvMixer
☆33Updated 2 years ago
Jasson-Chen / Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…
☆29Updated 3 years ago
jiay7 / wenet_onlinedecode
Went online decode demo
☆30Updated 4 years ago
mycrazycracy / speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Updated 6 years ago
seorim0 / DCCRN-with-various-loss-functions
DCCRN with various loss functions
☆96Updated 2 years ago
felixfuyihui / AISHELL-4
☆128Updated 4 years ago
Windstudent / Complex-MTASSNet
Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.
☆93Updated 2 years ago
ConferencingSpeech / ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
☆45Updated 3 years ago
iiscleap / E2E-NPLDA
End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation
☆22Updated 3 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
RicherMans / CED
Source code for Consistent ensemble distillation for audio tagging
☆39Updated last month
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 5 years ago
Anaesthesiaye / sound_event_detection_transformer
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
☆45Updated 3 years ago
nii-yamagishilab / Attention_Backend_for_ASV
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Updated 2 years ago