The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)
☆37Jul 24, 2025Updated 8 months ago
Alternatives and similar repositories for WhiStress
Users that are interested in WhiStress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning☆17Jan 21, 2025Updated last year
- ☆20Mar 5, 2026Updated last month
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆49Aug 15, 2025Updated 8 months ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆15May 6, 2025Updated 11 months ago
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated 2 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated last year
- Text-to-Speech Benchmark☆23Apr 2, 2026Updated 2 weeks ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- ☆16Apr 2, 2025Updated last year
- ☆14Dec 1, 2025Updated 4 months ago
- In-car multi-channel speech transcription system of AISHELL-5.☆42Jun 9, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆108Jun 12, 2025Updated 10 months ago
- ☆17Mar 1, 2024Updated 2 years ago
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- ☆23Jun 24, 2024Updated last year
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).☆63Jul 1, 2025Updated 9 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆229Mar 14, 2026Updated last month
- Versatile Evaluation of Speech and Audio☆400Dec 9, 2025Updated 4 months ago
- ☆30Jan 22, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Mar 22, 2023Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆38Feb 5, 2026Updated 2 months ago
- Source Code for Graph Anomaly Detection with Unsupervised GNNs (ICDM2022)☆12Oct 18, 2022Updated 3 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆64Apr 29, 2021Updated 4 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 6 months ago
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆23Mar 31, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Nov 10, 2025Updated 5 months ago
- Code for AccentDB.☆23May 28, 2021Updated 4 years ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆42Nov 18, 2025Updated 4 months ago
- ☆17Jul 23, 2025Updated 8 months ago
- ☆17Jul 14, 2023Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Jan 16, 2024Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆77Feb 28, 2025Updated last year