用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现
☆53May 17, 2019Updated 6 years ago
Alternatives and similar repositories for SpeechProcessForMachineLearning
Users that are interested in SpeechProcessForMachineLearning are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- deep neural network based workflow for noise suppression and signal recovery of real-world LIGO observational data☆16Mar 14, 2024Updated last year
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆18Jun 27, 2017Updated 8 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆54Mar 24, 2023Updated 2 years ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆29Mar 31, 2022Updated 3 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- A summary of speech data augment algorithms☆69Jan 12, 2021Updated 5 years ago
- ☆15Jun 30, 2025Updated 8 months ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- ☆12Aug 2, 2024Updated last year
- Prior Sampling for high dimension data with domain knowledge.☆10Jan 11, 2022Updated 4 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- wechatter: An easy Conversation AI Chatbot Framework☆10Apr 15, 2021Updated 4 years ago
- C++ iterator that performs the cartesian product of many containers.☆12Jan 12, 2016Updated 10 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 3 weeks ago
- KENN - A Transformer Encoder for Gravitational Waves☆11Sep 29, 2025Updated 5 months ago
- 準備CPE大學程式能力檢定考試,CPE的考題皆出自於UVa。☆12Apr 17, 2017Updated 8 years ago
- It's yet another static site generator. Have you seen jekyll? hyde? Yup. Like those.☆49Aug 24, 2021Updated 4 years ago
- Study with M_Studio RPG Course☆12Oct 12, 2022Updated 3 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- Operating System for your EON Gold☆13Dec 19, 2018Updated 7 years ago
- PowerDEVS is an integrated tool for hybrid systems modeling and simulation based on the DEVS formalism.☆12Mar 20, 2021Updated 4 years ago
- Gravitational wave interferometer parameter optimisation game, written in Python and run in a Jupyter notebook.☆10Dec 18, 2018Updated 7 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆97Sep 3, 2020Updated 5 years ago
- ☆42Nov 22, 2024Updated last year
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆808Apr 6, 2023Updated 2 years ago
- Moved to https://github.com/tier4/autoware_launch.☆11Mar 21, 2023Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- Keras BERT with pre-trained weights☆10Feb 10, 2019Updated 7 years ago