bsxfan/meta-embeddings

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bsxfan/meta-embeddings)

bsxfan / meta-embeddings

Meta-embeddings are a probabilistic generalization of embeddings in machine learning.

☆23

Alternatives and similar repositories for meta-embeddings

Users that are interested in meta-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luferrer / DCA-PLDA
View on GitHub
Discriminative Condition-Aware PLDA
☆46Jul 23, 2024Updated 2 years ago
sanphiee / MOT-sGPLDA-SRE14
View on GitHub
Multiobjective Optimization Training of PLDA for Speaker Verification
☆10Jun 14, 2018Updated 8 years ago
joonson / voxsrc_2019
View on GitHub
VoxSRC Challenge
☆31Jun 11, 2019Updated 7 years ago
lucasondel / amdtk
View on GitHub
☆12Feb 26, 2018Updated 8 years ago
yinruiqing / change_detection
View on GitHub
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆67Jul 14, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BirdVox / PCEN-SNR
View on GitHub
Audio activity detector based on per-channel energy normalization (PCEN)
☆32Nov 16, 2018Updated 7 years ago
BUTSpeechFIT / x-vector-kaldi-tf
View on GitHub
Tensorflow implementation of x-vector topology on top of Kaldi recipe
☆118Nov 5, 2019Updated 6 years ago
nttcslab-sp / torchain
View on GitHub
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
☆20Feb 20, 2019Updated 7 years ago
deepaudio / deepaudio-speaker
View on GitHub
neural network based speaker embedder
☆24Jan 7, 2023Updated 3 years ago
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
Lallapallooza / fast-audiomentations
View on GitHub
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
☆38May 8, 2026Updated 2 months ago
rakshithShetty / dnn-speech
View on GitHub
This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition
☆12Dec 8, 2015Updated 10 years ago
deepvk / muse
View on GitHub
🎵 muse: Music Separation
☆11Feb 14, 2024Updated 2 years ago
feerci / feerci
View on GitHub
FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates
☆12Mar 13, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BUTSpeechFIT / DVBx
View on GitHub
Discriminative Training of VBx Diarization
☆28Sep 23, 2024Updated last year
tzyll / kaldi
View on GitHub
ASR cases for speech handbook at CSLT-THU, based on Kaldi toolkit and Thchs30 database, in egs/cslt_cases.
☆107Mar 12, 2021Updated 5 years ago
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
asogaard / Wavenet
View on GitHub
C++ package for learning optimal wavelet bases using a neural network approach.
☆14Dec 2, 2016Updated 9 years ago
rafaelvalle / asrgen
View on GitHub
Attacking Speaker Recognition with Deep Generative Models
☆34Mar 24, 2023Updated 3 years ago
tstafylakis / Speaker-Embeddings-Correlation-Pooling
View on GitHub
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
☆11Sep 20, 2021Updated 4 years ago
jefflai108 / Attentive-Filtering-Network
View on GitHub
University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.
☆50May 1, 2019Updated 7 years ago
kaituoxu / kaldi-ktnet1
View on GitHub
Kaldi extended by Kaituo XU with new features in nnet1.
☆12Dec 16, 2018Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
juanmc2005 / torch-plda
View on GitHub
PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf
☆15Oct 16, 2020Updated 5 years ago
MiuLab / Spk-Dialogue
View on GitHub
Speaker Role Contextual Model for Dialogues
☆15Sep 30, 2017Updated 8 years ago
qqueing / speaker_embedding-pytorch
View on GitHub
"An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation
☆19Oct 8, 2018Updated 7 years ago
GrantL10 / My-Python-Codes-for-Acoustics
View on GitHub
Basic Tools
☆13Dec 18, 2021Updated 4 years ago
Miffyli / asv-cm-reinforce
View on GitHub
Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE
☆13Mar 31, 2021Updated 5 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
mycrazycracy / speaker-embedding-with-phonetic-information
View on GitHub
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Jul 10, 2019Updated 7 years ago
antklen / idrnd_antispoofing_solution
View on GitHub
2nd place solution for ID R&D Voice Antispoofing Challenge
☆15Aug 22, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
lifeiteng / Rabbit
View on GitHub
Explore Text-To-Speech
☆25Jun 22, 2018Updated 8 years ago
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
qqueing / DeepSpeaker-pytorch
View on GitHub
Speaker embedding(verification and recognition) using Pytorch
☆369Jul 24, 2020Updated 6 years ago
alpoktem / punkProse
View on GitHub
Punctuation generation for speech transcripts using lexical and prosodic features
☆42Mar 5, 2019Updated 7 years ago
DanielMengLiu / DeepLip
View on GitHub
deep-learning based audio-visual lip bometrics
☆15May 9, 2023Updated 3 years ago
dansoutner / LSTMLM
View on GitHub
Simple LSTM language modelling toolkit
☆10Oct 21, 2022Updated 3 years ago