naxingyu/opensmile

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/naxingyu/opensmile)

naxingyu / opensmile

A github repo of the openSMILE feature extraction tool.

☆221

Alternatives and similar repositories for opensmile

Users that are interested in opensmile are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

audeering / opensmile
View on GitHub
The Munich Open-Source Large-Scale Multimedia Feature Extractor
☆840Jan 26, 2026Updated 5 months ago
audeering / opensmile-python
View on GitHub
Python package for openSMILE
☆327Jan 26, 2026Updated 5 months ago
changjenyin / DNN_HMM_RNN_speech
View on GitHub
"Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015
☆21Nov 25, 2016Updated 9 years ago
Renovamen / Speech-Emotion-Recognition
View on GitHub
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
☆1,310Mar 25, 2023Updated 3 years ago
carl03q / AudioClassifier
View on GitHub
A CNN audio classifier via spectrogram images.
☆10Jul 21, 2017Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
covarep / covarep
View on GitHub
A Cooperative Voice Analysis Repository for Speech Technologies
☆378Jul 27, 2020Updated 5 years ago
deadshot465 / novelcrafter-mcp
View on GitHub
An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.
☆11Dec 3, 2024Updated last year
naxingyu / lstm-rnn
View on GitHub
Portal of Johannes and Felix's RNN implementation and further modifications for ASR
☆21Nov 27, 2014Updated 11 years ago
ucbvislab / p2fa-vislab
View on GitHub
A script for audio/transcript alignment. Fork of p2fa.
☆69Mar 15, 2018Updated 8 years ago
luojie1024 / MossQA-mnbvc
View on GitHub
本项目主要对开源的MOSS SFT数据进行整理，转换成mnbvc多轮对话格式。MOSS-003涵盖用性、忠实性、无害性三个层面，共353w样本，MOSS-003 包含更细粒度的有用性类别标记、更广泛的无害性数据和更长对话轮数，共630w样本，
☆13Dec 3, 2023Updated 2 years ago
jameslyons / python_speech_features
View on GitHub
This library provides common speech features for ASR including MFCCs and filterbank energies.
☆2,423Oct 20, 2021Updated 4 years ago
magesh-technovator / awesome-ai-applications
View on GitHub
A Comprehensive survey on business use cases of AI that help them thrive in the digital economy
☆13Oct 7, 2020Updated 5 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
Sehaba95 / Emotions-recognition-from-audio-signal
View on GitHub
Emotions recognition from audio signal using OpenSmile, PCA and set of classifiers from Scikit-learn library
☆47Jun 13, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
akhil2495 / multi-modal-emotion-recognition
View on GitHub
A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset
☆13Dec 12, 2018Updated 7 years ago
AudioVisualEmotionChallenge / AVEC2018
View on GitHub
Baseline scripts of the 8th Audio/Visual Emotion Challenge (AVEC 2018)
☆62Jul 4, 2018Updated 8 years ago
hellolzc / SpeechEmotionRecognition-emodb
View on GitHub
Speech Emotion Recognition
☆27Jun 19, 2020Updated 6 years ago
RayanWang / Speech_emotion_recognition_BLSTM
View on GitHub
Bidirectional LSTM network for speech emotion recognition.
☆266Mar 31, 2019Updated 7 years ago
Kyubyong / tacotron_asr
View on GitHub
Speech Recognition Using Tacotron
☆164Sep 20, 2017Updated 8 years ago
google / mcic-coco
View on GitHub
☆24Dec 22, 2016Updated 9 years ago
glam-imperial / semantic_speech_emotion_recognition
View on GitHub
This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…
☆27Mar 18, 2021Updated 5 years ago
scosman / voicebox
View on GitHub
Exploration: using technology to aid people who lack both the ability to speak and fine motor control.
☆21Oct 24, 2024Updated last year
syang1993 / FFTNet
View on GitHub
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
☆94Jul 17, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
vmasrani / dementia_classifier
View on GitHub
Code for my masters thesis
☆18Jul 6, 2023Updated 3 years ago
yyxxrr739 / autosar-rag
View on GitHub
This is a AUTOSAR documents specific retriever based on LLM and RAG.
☆16Nov 12, 2024Updated last year
srvk / lm_build
View on GitHub
Adapting your own Language Model for Kaldi
☆63Jan 8, 2019Updated 7 years ago
emptymalei / sci2fi
View on GitHub
从科学到科幻
☆16Sep 25, 2015Updated 10 years ago
r9y9 / VoiceConversion.jl
View on GitHub
[Deprecated] Statistical Voice Conversion in Julia. See the website link for new library
☆38Apr 15, 2017Updated 9 years ago
xuanjihe / speech-emotion-recognition
View on GitHub
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
☆409Jul 8, 2019Updated 7 years ago
hkveeranki / speech-emotion-recognition
View on GitHub
Speaker independent emotion recognition
☆331Jun 26, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
leynos / novelcrafter-prompts
View on GitHub
☆14Apr 26, 2025Updated last year
david-yoon / multimodal-speech-emotion
View on GitHub
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
☆297Jun 17, 2024Updated 2 years ago
olix20 / google_keyword_detection_challenge
View on GitHub
https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/
☆21Mar 1, 2018Updated 8 years ago
lingochamp / kaldi-ctc
View on GitHub
Connectionist Temporal Classification (CTC) Automatic Speech Recognition
☆295Mar 6, 2018Updated 8 years ago
rkadlec / asreader
View on GitHub
This is an implementation of the Attention Sum Reader model as presented in "Text Comprehension with the Attention Sum Reader Network" av…
☆98Sep 9, 2016Updated 9 years ago
rtmdrr / replicability-analysis-NLP
View on GitHub
☆15Oct 19, 2020Updated 5 years ago
aalto-speech / speaker-diarization
View on GitHub
Speaker diarization scripts, based on AaltoASR
☆191Jan 3, 2019Updated 7 years ago