deeplyinc/Nonverbal-Vocalization-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deeplyinc/Nonverbal-Vocalization-Dataset)

deeplyinc / Nonverbal-Vocalization-Dataset

☆44

Alternatives and similar repositories for Nonverbal-Vocalization-Dataset

Users that are interested in Nonverbal-Vocalization-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

deeplyinc / Parent-Child-Vocal-Interaction-Dataset
View on GitHub
☆13Jan 13, 2022Updated 4 years ago
ajd12342 / paraspeechclap
View on GitHub
Codebase for 'ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining'
☆23Jun 20, 2026Updated last month
fredbuhl / EmoGator
View on GitHub
☆32Jun 30, 2023Updated 3 years ago
YuanGongND / vocalsound
View on GitHub
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
☆165Nov 12, 2022Updated 3 years ago
voberoi / voice-search-with-whisper-duckdb-and-metaphone
View on GitHub
This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…
☆13May 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DDATT / Vits2-onnx-cpp
View on GitHub
Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++
☆19Apr 17, 2024Updated 2 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
HumeAI / competitions
View on GitHub
Hume AI ML Competitions
☆31Apr 7, 2026Updated 3 months ago
JunhoKim94 / ASR_project
View on GitHub
This repository created for the NHN ASR hackathon competition.
☆11Sep 20, 2023Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
ZackHodari / discrete_intonation
View on GitHub
Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…
☆17May 24, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
desh2608 / diarizer
View on GitHub
Clustering-based methods for overlapping diarization
☆84Jan 12, 2024Updated 2 years ago
SpeechClub / CDER_Metric
View on GitHub
CDER (Conversational Diarization Error Rate) Scoring Tool
☆22Sep 13, 2022Updated 3 years ago
Speech-Lab-IITM / English_ASR_Challenge
View on GitHub
English ASR Challenge organized by Speech Lab, IIT Madras
☆10Feb 3, 2021Updated 5 years ago
V-Cloak / V-Cloak
View on GitHub
☆25Dec 20, 2022Updated 3 years ago
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
SuperKogito / pydiogment
View on GitHub
Python library for audio augmentation
☆84Jul 6, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
omine-me / LaughterSegmentation
View on GitHub
2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…
☆66Sep 1, 2024Updated last year
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
seanghay / vits.cpp
View on GitHub
VITS Inference using ONNX Runtime on C++
☆13Dec 25, 2023Updated 2 years ago
Taltt / FNSE-SBGAN
View on GitHub
FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks
☆20May 12, 2025Updated last year
NeuroLexDiagnostics / DigiPsych_Prosody
View on GitHub
UW DigiPsych Prosody Feature Extraction Repository
☆13May 16, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
finecodekr / addresskr
View on GitHub
☆11May 27, 2026Updated last month
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
adasegroup / OSM-one-shot-multispeaker
View on GitHub
Framework for one-shot multispeaker system based on Deep Learning
☆19May 30, 2021Updated 5 years ago
knottwill / sesame-finetune
View on GitHub
Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune
☆113Sep 27, 2025Updated 9 months ago
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 3 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago