audeering / opensmileLinks
The Munich Open-Source Large-Scale Multimedia Feature Extractor
☆680Updated last year
Alternatives and similar repositories for opensmile
Users that are interested in opensmile are comparing it to the libraries listed below
Sorting:
- Python package for openSMILE☆281Updated 6 months ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆346Updated 8 months ago
- Crowd Sourced Emotional Multimodal Actors Dataset (CREMA-D)☆439Updated 3 months ago
- feature extraction from speech signals☆376Updated last week
- spafe: Simplified Python Audio Features Extraction☆475Updated 3 months ago
- OpenL3: Open-source deep audio and image embeddings☆523Updated 2 years ago
- A github repo of the openSMILE feature extraction tool.☆217Updated 3 years ago
- Large, modern dataset for speech recognition☆677Updated last year
- A Cooperative Voice Analysis Repository for Speech Technologies☆364Updated 4 years ago
- Python library for downloading, loading & working with sound datasets☆342Updated last month
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆445Updated 5 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆973Updated last year
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆815Updated 5 months ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆201Updated 5 years ago
- A library for speech data augmentation in time-domain☆664Updated 3 years ago
- Metadata, scripts and baselines for the MTG-Jamendo dataset☆319Updated this week
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆329Updated last year
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,052Updated 5 months ago
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆705Updated 11 months ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆687Updated 8 months ago
- A Python wrapper for the high-quality vocoder "World"☆755Updated 5 months ago
- This is the GitHub page for publicly available emotional speech data.☆354Updated 3 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆860Updated 4 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,607Updated last year
- List of speech synthesis papers.☆1,045Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆483Updated 3 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆367Updated 2 years ago
- An open source dataset for source separation☆429Updated last year
- ☆1,496Updated 10 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,336Updated last year