slp-rl / WhiStressView external linksLinks
The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)
☆35Jul 24, 2025Updated 6 months ago
Alternatives and similar repositories for WhiStress
Users that are interested in WhiStress are comparing it to the libraries listed below
Sorting:
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning☆17Jan 21, 2025Updated last year
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated last year
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆26Mar 13, 2025Updated 11 months ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆14May 6, 2025Updated 9 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- ☆19Nov 4, 2025Updated 3 months ago
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.☆20Jan 22, 2026Updated 3 weeks ago
- ☆17Mar 1, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆48Aug 15, 2025Updated 6 months ago
- Text-to-Speech Latency Benchmark☆22Jan 16, 2026Updated 3 weeks ago
- ☆22Jun 24, 2024Updated last year
- Versatile Evaluation of Speech and Audio☆389Dec 9, 2025Updated 2 months ago
- Code for AccentDB.☆23May 28, 2021Updated 4 years ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆36Jul 3, 2025Updated 7 months ago
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- In-car multi-channel speech transcription system of AISHELL-5.☆40Jun 9, 2025Updated 8 months ago
- Universal multilingual automatic speech transcription into IPA☆75Feb 28, 2025Updated 11 months ago
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆33May 31, 2023Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆35Feb 5, 2026Updated last week
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- Prepend universal audio attack segment to mute Whisper☆36Jan 22, 2025Updated last year
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆14Jan 6, 2025Updated last year
- ☆97Oct 16, 2025Updated 3 months ago
- ☆27Updated this week
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆81Jun 7, 2024Updated last year
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Jan 16, 2024Updated 2 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Jan 7, 2023Updated 3 years ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆152Sep 14, 2023Updated 2 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆43Sep 7, 2025Updated 5 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated last year
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆17Feb 24, 2025Updated 11 months ago