zkmkarlsruhe/language-identification

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zkmkarlsruhe/language-identification)

zkmkarlsruhe / language-identification

Spoken Language Identification on Common Voice and AudioSet using Deep Learning

☆42

Alternatives and similar repositories for language-identification

Users that are interested in language-identification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lhx94As / Awesome-Spoken-Language-Identification
View on GitHub
An awesome spoken LID repository. (Working in progress
☆109Apr 22, 2024Updated 2 years ago
AsoSoft / AsoSoft-Speech-Corpus
View on GitHub
AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…
☆10Mar 8, 2022Updated 4 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
nipunmanral / Spoken-Language-Identification
View on GitHub
Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
☆25Aug 2, 2024Updated last year
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
SpeechFlow-io / Spoken_language_identification
View on GitHub
A TensorFlow-based spoken language identification
☆100Mar 22, 2023Updated 3 years ago
henryleu / go-vad
View on GitHub
golang vad (voice activity detection) library based on webrtc
☆12Dec 13, 2021Updated 4 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
miguelballesteros / LSTM-punctuation
View on GitHub
☆11Feb 17, 2017Updated 9 years ago
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
coqui-ai / stt-model-manager
View on GitHub
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Mar 24, 2023Updated 3 years ago
iot-salzburg / nearest-advocate
View on GitHub
A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…
☆16Dec 3, 2025Updated 7 months ago
HPI-DeepLearning / crnn-lid
View on GitHub
Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
☆105Apr 11, 2018Updated 8 years ago
parolteknologio / stt-esperanto
View on GitHub
Deepspeech/Coqui AI speech to text systems in Esperanto. - Parolrekoniloj en Esperanto uzante Deepspeech/Coqui Ai.
☆11Jan 11, 2022Updated 4 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
mayukhnair / deepspeech-colab
View on GitHub
Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory
☆16Mar 18, 2019Updated 7 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
MozillaFoundation / engineering-handbook
View on GitHub
Mozilla Foundation's Engineering Handbook
☆13Sep 20, 2021Updated 4 years ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
sigmeta / g2p-kd
View on GitHub
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion
☆20Jul 9, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HarikalarKutusu / 3d-voice-chess
View on GitHub
A voice driven 3D chess game for learning Voice AI
☆17Jul 6, 2022Updated 4 years ago
google-research-datasets / uninum
View on GitHub
A database of number names for 186 languages, locales, and scripts
☆67Mar 3, 2023Updated 3 years ago
common-voice / cv-sentence-extractor
View on GitHub
Scraping Wikipedia for fair use sentences
☆54Jan 25, 2024Updated 2 years ago
lab260ru / balalaika
View on GitHub
[INTERSPEECH 2026] Official code for "Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech"
☆21Jul 19, 2026Updated last week
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
global-asp / asp-source
View on GitHub
Source stories from the African Storybook Project in Markdown format
☆22Jan 25, 2026Updated 6 months ago