bsxfan / meta-embeddingsView external linksLinks
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
Alternatives and similar repositories for meta-embeddings
Users that are interested in meta-embeddings are comparing it to the libraries listed below
Sorting:
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Oct 8, 2018Updated 7 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- Discriminative Condition-Aware PLDA☆45Jul 23, 2024Updated last year
- C++ package for learning optimal wavelet bases using a neural network approach.☆14Dec 2, 2016Updated 9 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Nov 16, 2018Updated 7 years ago
- VoxSRC Challenge☆31Jun 11, 2019Updated 6 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆106Mar 12, 2021Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆120Nov 5, 2019Updated 6 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 10 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- Speaker Role Contextual Model for Dialogues☆15Sep 30, 2017Updated 8 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 2 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 2 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated last year
- Simple LSTM language modelling toolkit☆10Oct 21, 2022Updated 3 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆42Mar 5, 2019Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago