A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
☆30Mar 14, 2025Updated last year
Alternatives and similar repositories for allophant
Users that are interested in allophant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Dec 22, 2023Updated 2 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆263May 9, 2022Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 3 years ago
- Universal multilingual automatic speech transcription into IPA☆80Feb 28, 2025Updated last year
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆15Dec 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 3 years ago
- Detect and remove or lower the volume of breathing in speech recordings.☆15May 14, 2025Updated last year
- A family of efficient speech models for multilingual phone recognition☆64Feb 12, 2026Updated 4 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Mar 17, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- ☆46Mar 17, 2026Updated 3 months ago
- Training code and dataset cleasing with Sidon☆128Apr 24, 2026Updated last month
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Feb 18, 2024Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆46Jun 4, 2026Updated 2 weeks ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆63Jul 1, 2025Updated 11 months ago
- A phoneme-allophone database for many languages☆53May 19, 2020Updated 6 years ago
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆52Jun 24, 2025Updated 11 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 3 years ago
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 10 months ago
- ☆20Sep 20, 2024Updated last year
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆19Nov 28, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Vocal Tract Area Estimation by Gradient Descent☆39Jul 16, 2023Updated 2 years ago
- ☆29Sep 5, 2024Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆98Dec 28, 2024Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆53May 1, 2025Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆22Oct 9, 2024Updated last year
- ☆26Nov 2, 2022Updated 3 years ago