audeering / opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
☆617Updated last year
Alternatives and similar repositories for opensmile:
Users that are interested in opensmile are comparing it to the libraries listed below
- Python package for openSMILE☆259Updated last month
- feature extraction from speech signals☆363Updated last week
- spafe: Simplified Python Audio Features Extraction☆464Updated 6 months ago
- A Cooperative Voice Analysis Repository for Speech Technologies☆353Updated 4 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆774Updated last week
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆309Updated 3 months ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆502Updated 2 years ago
- A github repo of the openSMILE feature extraction tool.☆213Updated 3 years ago
- A library for speech data augmentation in time-domain☆653Updated 3 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆366Updated last month
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆475Updated 3 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆983Updated this week
- Pytorch implementation of deep audio embedding calculation☆101Updated last year
- List of speech synthesis papers.☆1,017Updated last year
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆240Updated 2 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆320Updated last year
- Understanding emotions from audio files using neural networks and multiple datasets.☆416Updated last year
- End-to-End Neural Diarization☆386Updated 3 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆376Updated 5 months ago
- Audio processing by using pytorch 1D convolution network☆1,042Updated 11 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆930Updated last year
- Large, modern dataset for speech recognition☆656Updated 10 months ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆799Updated last week
- A library for soundscape synthesis and augmentation☆389Updated 2 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆429Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆308Updated 4 years ago
- Command line utility for forced alignment using Kaldi☆1,388Updated last month
- OpenL3: Open-source deep audio and image embeddings☆483Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,146Updated 3 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆580Updated 2 years ago