py-lidbox/lidbox

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/py-lidbox/lidbox)

py-lidbox / lidbox

End-to-end spoken language identification out of the box.

☆48

Alternatives and similar repositories for lidbox

Users that are interested in lidbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lhx94As / Awesome-Spoken-Language-Identification
View on GitHub
An awesome spoken LID repository. (Working in progress
☆109Apr 22, 2024Updated 2 years ago
HPI-DeepLearning / crnn-lid
View on GitHub
Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
☆105Apr 11, 2018Updated 8 years ago
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
nipunmanral / Spoken-Language-Identification
View on GitHub
Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
☆25Aug 2, 2024Updated last year
pedrocolon93 / ivectormatlabmsrit
View on GitHub
I-Vector Speaker recognition system implemented with MSRIT in matlab
☆15Jan 12, 2016Updated 10 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
vzxxbacq / PLDA
View on GitHub
This is a implementation of kaldi-plda.
☆15Jun 9, 2018Updated 8 years ago
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago
tomasz-oponowicz / spoken_language_identification
View on GitHub
Identify a spoken language using artificial intelligence (LID).
☆124Jul 10, 2018Updated 8 years ago
CatalinTiseanu / spoken-language-identification
View on GitHub
Winning 10,000$ submission for the Spoken Language Identification challenge on TopCoder
☆17Nov 28, 2017Updated 8 years ago
cadia-lvl / kaldi-speaker-diarization
View on GitHub
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Aug 12, 2024Updated last year
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
dominickrei / MatchboxNet
View on GitHub
An implementation of MatchboxNet
☆13May 4, 2022Updated 4 years ago
kleinzcy / speech_signal_processing
View on GitHub
☆15Jul 15, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
diff7 / tts-king
View on GitHub
a repository for trainabale tts multi speaker
☆14Nov 28, 2021Updated 4 years ago
hlt-bme-hu / hunspeech
View on GitHub
☆14Jan 24, 2017Updated 9 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
rajivpoddar / mmse-port
View on GitHub
MMSE STSA Speech enhancement
☆15Aug 24, 2015Updated 10 years ago
yogihbti / ccfdHMM
View on GitHub
Credit Card Fraud Detection using HMM ( Hidden Markow Model)
☆12Nov 2, 2017Updated 8 years ago
hyperion-ml / hyperion
View on GitHub
Python toolkit for speech processing
☆72Updated this week
open-speech / tf_kaldi_io
View on GitHub
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
☆40Nov 26, 2018Updated 7 years ago
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
pietz / language-recognition
View on GitHub
CNN to classify samples of voice recordings into the language that was spoken
☆44Apr 8, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChristopherCarignan / audio2stl
View on GitHub
Converts an audio file to a 3D spectrogram and (optionally) saves as a stereolithography (STL) file for 3D printing
☆22Oct 31, 2021Updated 4 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
sumansamui / EMG_Signal_Classification
View on GitHub
This is an on-going project repository
☆15Feb 10, 2024Updated 2 years ago
izlandman / iVector
View on GitHub
introduction to iVectors with available speech data
☆11Mar 4, 2016Updated 10 years ago
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
christianvazquez7 / ivector
View on GitHub
☆17Dec 11, 2014Updated 11 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
View on GitHub
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Oct 4, 2019Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
uhh-lt / bbb-live-subtitles
View on GitHub
BBB plugin for automatic subtitles in conference calls
☆28Apr 14, 2022Updated 4 years ago
aiola-lab / drax
View on GitHub
Drax: Speech Recognition with Discrete Flow Matching
☆75Oct 15, 2025Updated 9 months ago
mrusci / ondevice-learning-kws
View on GitHub
Test Framework for few-shot open set KWS
☆45Nov 8, 2024Updated last year
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
tokheim / iVector
View on GitHub
☆19Jun 25, 2012Updated 14 years ago
stas6626 / IDRnd
View on GitHub
ID R&D Voice Antispoofing Challenge Solution
☆11Jul 27, 2019Updated 6 years ago
harvard-edge / multilingual_kws
View on GitHub
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆190Dec 6, 2024Updated last year