seungheondoh/hi_kia

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seungheondoh/hi_kia)

seungheondoh / hi_kia

wake-up word emotion recognition [APSIPA 2022]

☆17

Alternatives and similar repositories for hi_kia

Users that are interested in hi_kia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
olami-developers / olami-android-hotword-detect-sdk
View on GitHub
Hotword Detection (Wake Word Detection) Android library and sample codes
☆11Apr 9, 2018Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
W-Wu / ERC-SLT22
View on GitHub
Code for "Distribution-based Emotion Recognition in Conversation"
☆18Feb 6, 2023Updated 3 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
walker-hyf / MnTTS
View on GitHub
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)
☆23Dec 5, 2022Updated 3 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
LeadingIndiaAI / Wake-UP-word-detection
View on GitHub
Wake-up-word(WUW)system is an emerging development in recent times. Voice interaction with systems have made life ease and aids in multi-…
☆18Mar 11, 2019Updated 7 years ago
lucasjinreal / textfrontend
View on GitHub
单独维护的中文TTS
☆34Oct 28, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
DDATT / Vits2-onnx-cpp
View on GitHub
Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++
☆19Apr 17, 2024Updated 2 years ago
MaxMax2016 / max-vc
View on GitHub
singing voice conversion without f0
☆23May 10, 2023Updated 3 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
Liangzheng-ZL / BEdit-TTS
View on GitHub
Speech samples and code of BEdit-TTS
☆34Oct 8, 2023Updated 2 years ago
seungheondoh / musical-word-embedding
View on GitHub
Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]
☆29Apr 23, 2024Updated 2 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
AsoSoft / AsoSoft-TTS-Speech-Corpus-for-Central-Kurdish
View on GitHub
AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech
☆23Jun 24, 2022Updated 4 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
LianjiaTech / athena
View on GitHub
An open-source implementation of sequence-to-sequence based speech processing engine
☆39Jan 11, 2023Updated 3 years ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
giovana-morais / steme
View on GitHub
[ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation
☆13Aug 2, 2023Updated 2 years ago
Hayeonbang / PIAST
View on GitHub
A piano music dataset with Audio, Symbolic and Text labels
☆36Mar 6, 2025Updated last year
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
prosodylab / prosobeast-annotation-tool
View on GitHub
☆41Feb 16, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
pengzhendong / streaming-vocos
View on GitHub
Streaming Vocos
☆31Jun 10, 2025Updated last year
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
eloimoliner / audio-inpainting-diffusion
View on GitHub
☆74Apr 4, 2024Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year