A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
☆29Mar 14, 2025Updated last year
Alternatives and similar repositories for allophant
Users that are interested in allophant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Dec 22, 2023Updated 2 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆261May 9, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆78Feb 28, 2025Updated last year
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆15Dec 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 2 years ago
- Detect and remove or lower the volume of breathing in speech recordings.☆14May 14, 2025Updated 11 months ago
- A family of efficient speech models for multilingual phone recognition☆54Feb 12, 2026Updated 2 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆20Apr 10, 2025Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆20Nov 3, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆19Mar 17, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- ☆43Mar 17, 2026Updated last month
- Training code and dataset cleasing with Sidon☆97Mar 29, 2026Updated 2 weeks ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Feb 18, 2024Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Mar 17, 2026Updated last month
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆62Jul 1, 2025Updated 9 months ago
- A phoneme-allophone database for many languages☆53May 19, 2020Updated 5 years ago
- ☆52Jun 24, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- A collection of utilities for handling IPA phones.☆27Sep 24, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 8 months ago
- ☆20Sep 20, 2024Updated last year
- Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation☆18Nov 28, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- Vocal Tract Area Estimation by Gradient Descent☆39Jul 16, 2023Updated 2 years ago
- ☆28Sep 5, 2024Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆93Dec 28, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 11 months ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- ☆26Nov 2, 2022Updated 3 years ago