cnlinxi / speech_emotionView external linksLinks
Detect emotion from audio
β13Nov 20, 2018Updated 7 years ago
Alternatives and similar repositories for speech_emotion
Users that are interested in speech_emotion are comparing it to the libraries listed below
Sorting:
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- π LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.β22Jul 12, 2019Updated 6 years ago
- Onset-and-Offset-Aware Sound Event Detectionβ20Feb 10, 2025Updated last year
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 3 years ago
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancementβ16Jul 11, 2025Updated 7 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantβ10Aug 12, 2019Updated 6 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This codeβ¦β10Dec 25, 2019Updated 6 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)β11Dec 4, 2023Updated 2 years ago
- Sing any popular song with your voiceβ11Jul 10, 2022Updated 3 years ago
- Launch your speech synthesis within one minute.β12May 6, 2024Updated last year
- β14Jun 12, 2015Updated 10 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.β36Jun 25, 2024Updated last year
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Jul 22, 2021Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"β36Dec 8, 2019Updated 6 years ago
- β17Nov 25, 2019Updated 6 years ago
- Anonymous ICLR Submissionβ14Sep 25, 2019Updated 6 years ago
- cpp inference for EmotiVoiceβ16Jan 1, 2024Updated 2 years ago
- Example workflow for our data-centric speech benchmarkβ17Jul 6, 2023Updated 2 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) paβ¦β17May 15, 2015Updated 10 years ago
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.β34Oct 11, 2025Updated 4 months ago
- β17Jul 22, 2024Updated last year
- This repository is for wake-word detection in speech using recurrent neural networksβ17Feb 25, 2019Updated 6 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORMβ18May 17, 2024Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ35Feb 18, 2022Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activitβ¦β22Jan 10, 2025Updated last year
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis Systemβ15Mar 31, 2019Updated 6 years ago
- β20Jul 22, 2022Updated 3 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ20Jul 21, 2020Updated 5 years ago
- TPSE-GST Tacotron2β14May 1, 2019Updated 6 years ago
- Scripts for training Kaldi for German speech recognition (ASR).β26Feb 11, 2021Updated 5 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022β15Jun 18, 2022Updated 3 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speechβ21Mar 21, 2022Updated 3 years ago
- transcribe audio feeds into public web uiβ45Aug 31, 2022Updated 3 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algorβ¦β19Mar 15, 2020Updated 5 years ago
- Speech Resynthesis and Language Modelingβ27Jun 11, 2025Updated 8 months ago