The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)
☆37Jul 24, 2025Updated 9 months ago
Alternatives and similar repositories for WhiStress
Users that are interested in WhiStress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning☆17Jan 21, 2025Updated last year
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.☆21Jan 22, 2026Updated 3 months ago
- ☆20Mar 5, 2026Updated 2 months ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆49Aug 15, 2025Updated 8 months ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆15May 6, 2025Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated 2 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated last year
- Text-to-Speech Benchmark☆24Apr 2, 2026Updated last month
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- ☆16Apr 2, 2025Updated last year
- In-car multi-channel speech transcription system of AISHELL-5.☆43Jun 9, 2025Updated 11 months ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆109Jun 12, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Mar 1, 2024Updated 2 years ago
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).☆63Jul 1, 2025Updated 10 months ago
- ☆23Jun 24, 2024Updated last year
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆230Mar 14, 2026Updated last month
- Versatile Evaluation of Speech and Audio☆401Updated this week
- ☆30Apr 29, 2026Updated last week
- ☆11Mar 22, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Jul 6, 2023Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆39Feb 5, 2026Updated 3 months ago
- Source Code for Graph Anomaly Detection with Unsupervised GNNs (ICDM2022)☆12Oct 18, 2022Updated 3 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Official repo and evaluation implementation of KnowRecall and VisRecall☆10May 22, 2025Updated 11 months ago
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 5 years ago
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆24Mar 31, 2025Updated last year
- ☆15Nov 10, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for AccentDB.☆23May 28, 2021Updated 4 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Apr 20, 2026Updated 2 weeks ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆44Nov 18, 2025Updated 5 months ago
- ☆17Jul 23, 2025Updated 9 months ago
- ☆17Jul 14, 2023Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Jan 16, 2024Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆80Feb 28, 2025Updated last year