theolepage/ssl-for-slr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/theolepage/ssl-for-slr)

theolepage / ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

☆19

Alternatives and similar repositories for ssl-for-slr

Users that are interested in ssl-for-slr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

theolepage / prophecy
View on GitHub
A tiny deep neural network framework developed from scratch in C++ and CUDA.
☆13Feb 18, 2021Updated 5 years ago
theolepage / sslsv
View on GitHub
Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).
☆39Jun 25, 2026Updated last month
drguigui1 / participating-media-cloud-rendering
View on GitHub
☆11Jul 27, 2021Updated 5 years ago
gzhu06 / Y-vector
View on GitHub
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
☆24Jul 16, 2024Updated 2 years ago
hyperion-ml / hyperion
View on GitHub
Python toolkit for speech processing
☆72Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
xcmyz / Tacotron2-Pytorch
View on GitHub
follow NVIDIA, simplify it and support data parallel.
☆13Sep 26, 2019Updated 6 years ago
msh9184 / contrastive-equilibrium-learning
View on GitHub
☆21Apr 6, 2021Updated 5 years ago
mborsdorf / UniversalSpeakerExtraction
View on GitHub
☆15Sep 6, 2021Updated 4 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
TaoRuijie / Loss-Gated-Learning
View on GitHub
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆92May 29, 2023Updated 3 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
Levent9 / Zero-shot-FaceVC
View on GitHub
☆19Mar 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gaziduc / space-war
View on GitHub
Space War is a shoot'em up game where you pilot a spaceship and your goal is to destroy all the enemies ship.
☆12Apr 13, 2025Updated last year
serdarozsoy / corinfomax-ssl
View on GitHub
PyTorch implementation of CorInfoMax
☆23Dec 26, 2022Updated 3 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago
sathibault / computer_architecture_class
View on GitHub
Resources from my class on computer architecture design
☆10Apr 25, 2018Updated 8 years ago
seongmin-kye / CAP
View on GitHub
Cross attentive pooling for speaker verification (IEEE SLT, 2021)
☆12Dec 14, 2020Updated 5 years ago
Kahsolt / TransTacoS-RetuneGAN
View on GitHub
A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.
☆15May 25, 2022Updated 4 years ago
RookieJunChen / dns_mos_calculate
View on GitHub
Code for calculate DNS_MOS.
☆43Dec 18, 2022Updated 3 years ago
NoneOfAllOfTheAbove / ocr
View on GitHub
An Optical Character Recognition software based on a simple neural network created from scratch in C.
☆19Apr 5, 2019Updated 7 years ago
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
vincenzo-scotti / ITAcotron_2
View on GitHub
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10Oct 3, 2022Updated 3 years ago
SiD3W4y / zelda-alttp-re
View on GitHub
Tools and documentation about 'A Link to the Past' (GBA) internals.
☆11Mar 15, 2020Updated 6 years ago
KunZhou9646 / controllable_evc_code
View on GitHub
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Nov 3, 2021Updated 4 years ago
b04901014 / FG-transformer-TTS
View on GitHub
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆90Mar 5, 2022Updated 4 years ago
ranchlai / speaker-verification
View on GitHub
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆97Sep 15, 2021Updated 4 years ago
msaroufim / ML-devops
View on GitHub
Helper scripts I use to run many experiments in the morning to check at night
☆20Jun 14, 2021Updated 5 years ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago
duyichao / E2E-ST-TDA
View on GitHub
Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"
☆17Dec 23, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PoCInnovation / Deep-PoC
View on GitHub
Deep-PoC is a deepFake detection tool designed to detect deepfakes from videos or images using artificial intelligence.
☆14Sep 23, 2021Updated 4 years ago
JonnyKong / Udemy-ShellScripting
View on GitHub
Udemy: Shell Scripting and Command Line Tasks
☆12Mar 13, 2018Updated 8 years ago
atomicoo / Tacotron2-PyTorch
View on GitHub
PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。
☆14May 17, 2021Updated 5 years ago
quanghuy0497 / The_PhD_Guidance
View on GitHub
Reasons, motivations, and resources that help pursuing the doctoral degree in Computer Science
☆15Mar 5, 2022Updated 4 years ago
superannotateai / model-deployment-tutorials
View on GitHub
☆14Jun 8, 2021Updated 5 years ago
bsxfan / PYLLR
View on GitHub
Python toolkit for likelihood-ratio calibration of binary classifiers
☆25Feb 21, 2023Updated 3 years ago
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago