Lhx94As/PHO-LID

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Lhx94As/PHO-LID)

Lhx94As / PHO-LID

PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification

☆21

Alternatives and similar repositories for PHO-LID

Users that are interested in PHO-LID are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
Lhx94As / Awesome-Spoken-Language-Identification
View on GitHub
An awesome spoken LID repository. (Working in progress
☆109Apr 22, 2024Updated 2 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
jreremy / conformer
View on GitHub
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
☆29May 1, 2024Updated 2 years ago
py-lidbox / lidbox
View on GitHub
End-to-end spoken language identification out of the box.
☆48Dec 13, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
msh9184 / ska-tdnn
View on GitHub
☆26Nov 2, 2022Updated 3 years ago
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago
fedderrico / ubm_map_diarization
View on GitHub
Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
YChenL / DS-TDNN
View on GitHub
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
☆41Aug 31, 2023Updated 2 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 7 months ago
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
media-sec-lab / BAM
View on GitHub
The pytorch implementation of BAM for Partialspoof Audio Localization.
☆33Aug 16, 2024Updated last year
msplabresearch / MSP-Podcast_Challenge_IS2025
View on GitHub
MSP-Podcast Challenge Baseline Code for Interspeech 2025
☆29Dec 4, 2024Updated last year
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
IDRnD / redimnet
View on GitHub
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
☆205Jul 9, 2026Updated 2 weeks ago
hyperion-ml / hyperion
View on GitHub
Python toolkit for speech processing
☆72Updated this week
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago
Choddeok / Affectron
View on GitHub
[ACL 2026 Findings] Affectron: Emotional Speech Synthesis with Affective and Contextually Aligned Nonverbal Vocalizations
☆20Jul 16, 2026Updated last week
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zkmkarlsruhe / language-identification
View on GitHub
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
☆42Feb 4, 2026Updated 5 months ago
YMLLG / SPEECHFAKE
View on GitHub
SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods
☆27Aug 13, 2025Updated 11 months ago
Jungjee / INTERSPEECH2023_T6
View on GitHub
Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning
☆23Aug 20, 2023Updated 2 years ago
Liu-Tianchi / Golden-Gemini-for-Speaker-Verification
View on GitHub
Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'
☆15Jan 20, 2025Updated last year
iot-salzburg / nearest-advocate
View on GitHub
A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…
☆16Dec 3, 2025Updated 7 months ago
emirdemirel / ASA_ICASSP2021
View on GitHub
A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…
☆15Oct 13, 2022Updated 3 years ago
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
haideraltahan / CLAR
View on GitHub
☆18Apr 12, 2021Updated 5 years ago
LvHang / pitch
View on GitHub
a standalone pitch extractor
☆13Oct 19, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ayh2bxa / realtime_nkf_aec
View on GitHub
☆18Dec 27, 2023Updated 2 years ago
Lhx94As / E2E-language-diarization
View on GitHub
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Jan 23, 2022Updated 4 years ago
nipunmanral / Spoken-Language-Identification
View on GitHub
Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
☆25Aug 2, 2024Updated last year
cgaroufis / MSCOL_SMC23
View on GitHub
Code for reproducing the experiments and results of "Multi-Source Contrastive Learning from Musical Audio", accepted for publication in S…
☆17Nov 13, 2023Updated 2 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
BakerBunker / SALT
View on GitHub
[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
☆23Aug 13, 2024Updated last year