Speech Emotion Recognition
☆43Aug 22, 2023Updated 2 years ago
Alternatives and similar repositories for speech-emotion-webapp
Users that are interested in speech-emotion-webapp are comparing it to the libraries listed below
Sorting:
- ☆15Nov 11, 2024Updated last year
- ☆13Sep 1, 2023Updated 2 years ago
- A Python library for high-quality, fast, and customizable dynamic audio compression and peak limiting.☆15Oct 24, 2025Updated 4 months ago
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated last year
- camera monitoring and alerts using deepstack☆13Jun 2, 2020Updated 5 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- ☆23Feb 27, 2021Updated 5 years ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆35Aug 1, 2025Updated 7 months ago
- ☆26Sep 22, 2022Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- ☆25Apr 18, 2025Updated 10 months ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- 使用ONNXRuntime部署E2Pose人体关键点检测,一共包含20个onnx模型,依然是C++和Python两个版本的程序☆16Dec 15, 2022Updated 3 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆116Mar 6, 2022Updated 4 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Feb 9, 2025Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆20May 7, 2025Updated 9 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Extension program for DIFF-SVC to make it more easy to use☆16Dec 26, 2022Updated 3 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆24Nov 12, 2025Updated 3 months ago
- Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆70Oct 27, 2022Updated 3 years ago
- ☆19Feb 2, 2023Updated 3 years ago