ICASSP2026 HumDial Challenge
☆35Dec 13, 2025Updated 2 months ago
Alternatives and similar repositories for Hum-Dial
Users that are interested in Hum-Dial are comparing it to the libraries listed below
Sorting:
- ☆30Sep 15, 2025Updated 5 months ago
- ☆18May 27, 2025Updated 9 months ago
- ☆29Nov 4, 2025Updated 3 months ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Dec 26, 2025Updated 2 months ago
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆220Jan 20, 2026Updated last month
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆74May 18, 2025Updated 9 months ago
- Dataset☆30Jul 31, 2025Updated 7 months ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated 3 weeks ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- ☆38Apr 3, 2025Updated 10 months ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆16Nov 19, 2025Updated 3 months ago
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆25Updated this week
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆43Mar 3, 2025Updated 11 months ago
- Vox-Profile Benchmark☆69Feb 16, 2026Updated last week
- ☆31Oct 28, 2025Updated 4 months ago
- A text normalization framework using GBM and human-generated features☆10Feb 4, 2020Updated 6 years ago
- This repository follows Luke Smith's Latex Resume Tutorial.☆10Jun 21, 2019Updated 6 years ago
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆16Apr 7, 2025Updated 10 months ago
- LaTeX tutorial using Texmaker. This repository follows [Michelle Krummel's Tutorial](https://www.youtube.com/watch?v=SoDv0qhyysQ&list=PL1…☆11Jun 14, 2018Updated 7 years ago
- Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.☆14Apr 4, 2023Updated 2 years ago
- People who suffer from low vision, sight and visual impairment are not able to see words and letters in ordinary newsprint, books and mag…☆10Oct 1, 2020Updated 5 years ago
- Improving beat tracking algorithms with recurrent neural networks.☆11Jan 7, 2019Updated 7 years ago
- PyTorch implementation of the paper Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis presented at…☆20Apr 2, 2025Updated 10 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- Facial Alignment for Anime Styled Faces☆10Mar 26, 2021Updated 4 years ago
- ☆16Nov 11, 2025Updated 3 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆481Nov 23, 2025Updated 3 months ago
- An object-oriented interface for abstracting away the ugly parts of ad server APIs☆14Apr 8, 2016Updated 9 years ago
- ☆11Dec 17, 2025Updated 2 months ago
- pix2pix TensorFlow Implementation☆13Nov 20, 2018Updated 7 years ago
- ☆13Jan 12, 2023Updated 3 years ago
- ☆11Feb 5, 2019Updated 7 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Data generator for stereo sound event localization and detection task of DCASE 2025 challenge☆14Jul 17, 2025Updated 7 months ago
- ☆13Dec 22, 2023Updated 2 years ago
- This repository follows [권현우](https://www.youtube.com/watch?v=V1Q6vEuoAQ0&list=PLSS68lwkeqyOH6KEHpCAmCWVSSKbciz3A) LaTex Tutorial.☆11Jul 16, 2018Updated 7 years ago
- This repository reimplements all kinds of published GAN☆11Dec 4, 2018Updated 7 years ago