☆34Jun 15, 2021Updated 4 years ago
Alternatives and similar repositories for Phoneme-BERT
Users that are interested in Phoneme-BERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Jun 12, 2023Updated 2 years ago
- ☆20Sep 2, 2024Updated last year
- ☆11Oct 24, 2022Updated 3 years ago
- Deep Articulatory Synthesis and Inversion☆55Feb 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"☆10Dec 17, 2023Updated 2 years ago
- ☆13Oct 11, 2024Updated last year
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated last month
- ☆26Sep 22, 2022Updated 3 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- ☆11Mar 22, 2023Updated 3 years ago
- Hierarchical Context Tagger for utterance rewriting☆13Mar 27, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains the VLEngagement dataset and the helper functions/ tools required to work with the dataset.☆16Dec 3, 2021Updated 4 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆48Mar 25, 2022Updated 4 years ago
- ☆10Oct 16, 2025Updated 5 months ago
- ☆19Mar 22, 2024Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 5 months ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 11 months ago
- ☆25Apr 24, 2019Updated 6 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Active noise controller (ANC) design: a practical primer☆13Jan 8, 2026Updated 2 months ago
- Pre-train Embedding in LightFM Recommender System Framework☆11Apr 28, 2019Updated 6 years ago
- High-performance, semantic turn detection for conversational AI☆36Oct 1, 2025Updated 5 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- ☆11Nov 5, 2021Updated 4 years ago
- VOICEVOX ENGINE、VOICEVOX NEMO ENGINE、COEIROINK用コ マンドラインクライアント。複数のエンジンを使用した並列処理もできます☆11May 4, 2024Updated last year