jh-cha-prml/JELLY

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jh-cha-prml/JELLY)

jh-cha-prml / JELLY

Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"

☆14

Alternatives and similar repositories for JELLY

Users that are interested in JELLY are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
yangdongchao / Omni-AutoThink
View on GitHub
Adaptive Multimodal Reasoning via Reinforcement Learning
☆23Jan 11, 2026Updated 6 months ago
vivian556123 / NeurIPS2024-CoVoMix
View on GitHub
Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
☆67Jan 16, 2025Updated last year
PeerGroup-JavaSpringBoot / SpringBootStudy
View on GitHub
🍀Spring Boot를 함께 공부하고 기록해나가는 공간입니다🍀
☆32Jul 8, 2022Updated 4 years ago
scutcsq / DWFormer
View on GitHub
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆69Jul 8, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
HappyColor / DrawSpeech_PyTorch
View on GitHub
☆25Nov 25, 2025Updated 8 months ago
Hertin / WavPrompt
View on GitHub
☆37Jun 30, 2022Updated 4 years ago
YuanGongND / llm_speech_emotion_challenge
View on GitHub
☆23Jun 24, 2024Updated 2 years ago
NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
khfs / DuplexMamba
View on GitHub
☆18Mar 6, 2026Updated 4 months ago
gwh22 / LAFMA
View on GitHub
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)
☆44Jun 13, 2024Updated 2 years ago
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
yongaifadian1 / MNV-17
View on GitHub
Qwen2.5-Omni fine-tuned on MNV-17 dataset for nonverbal vocalization recognition
☆31Nov 13, 2025Updated 8 months ago
cwang621 / blsp-emo
View on GitHub
BLSP-Emo: Towards Empathetic Large Speech-Language Models
☆62Jun 7, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Dalia-Sher / Speech-Emotion-Recognition-using-BLSTM-with-Attention
View on GitHub
We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…
☆11Jul 24, 2024Updated 2 years ago
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
thuhcsi / VoxInstruct
View on GitHub
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
☆100Nov 9, 2024Updated last year
zhenye234 / LLaSA_inference
View on GitHub
☆43Feb 8, 2025Updated last year
kaistmm / VoiceDiT
View on GitHub
[ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis
☆52Apr 9, 2025Updated last year
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
TomWang-NPU / MeCo
View on GitHub
AAMAS 2026: MeCo: Enhancing LLM-Empowered Multi-Robot Collaboration via Similar Task Memoization
☆32Jun 8, 2026Updated last month
nhut-ngnn / Multimodal-Speech-Emotion-Recognition
View on GitHub
A multimodal SER project combining BERT and ECAPA-TDNN with cross-attention-based fusion on the IEMOCAP dataset.
☆11Dec 9, 2024Updated last year
NVIDIA / nv-sflow
View on GitHub
A Python CLI workflow orchestrator with pluggable backends (e.g. local, Slurm) for running declarative YAML DAGs, collecting logs, and or…
☆38Updated this week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
y-ren16 / OV-InstructTTS
View on GitHub
☆22Jan 27, 2026Updated 6 months ago
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
kaistmm / VoxMM
View on GitHub
☆23May 11, 2026Updated 2 months ago
zhuole1025 / LLMs_as_Visual_Explainers
View on GitHub
Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"
☆15Apr 20, 2025Updated last year
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
hs-oh-prml / ComVo
View on GitHub
[ICLR 2026] Official implementation of Toward Complex-Valued Neural Networks for Waveform Generation
☆20Apr 10, 2026Updated 3 months ago
glam-imperial / semantic_speech_emotion_recognition
View on GitHub
This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…
☆27Mar 18, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
thuhcsi / mm2022-conversational-tts
View on GitHub
☆11May 9, 2023Updated 3 years ago
papercup-open-source / subscale-wavernn
View on GitHub
Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo
☆19Oct 8, 2020Updated 5 years ago
nlp-waseda / traveling-across-languages
View on GitHub
Official repo and evaluation implementation of KnowRecall and VisRecall
☆10May 22, 2025Updated last year
sh-lee-prml / PeriodWave
View on GitHub
The official Implementation of PeriodWave and PeriodWave-Turbo
☆225Apr 14, 2025Updated last year
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
line / promptttspp
View on GitHub
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆86Oct 11, 2024Updated last year
pkgonan / kafka-listener
View on GitHub
Kafka runtime listener
☆17Nov 17, 2019Updated 6 years ago