Collection of self-supervised models for speaker and language recognition tasks.
☆19Jan 18, 2022Updated 4 years ago
Alternatives and similar repositories for ssl-for-slr
Users that are interested in ssl-for-slr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tiny deep neural network framework developed from scratch in C++ and CUDA.☆13Feb 18, 2021Updated 5 years ago
- Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).☆37Feb 12, 2026Updated last month
- ☆11Jul 27, 2021Updated 4 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Jul 16, 2024Updated last year
- Python toolkit for speech processing☆72Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Sep 26, 2019Updated 6 years ago
- ☆21Apr 6, 2021Updated 4 years ago
- ☆15Sep 6, 2021Updated 4 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆92May 29, 2023Updated 2 years ago
- ☆19Mar 2, 2024Updated 2 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Jun 14, 2022Updated 3 years ago
- PyTorch implementation of CorInfoMax☆23Dec 26, 2022Updated 3 years ago
- Resources from my class on computer architecture design☆10Apr 25, 2018Updated 7 years ago
- Cross attentive pooling for speaker verification (IEEE SLT, 2021)☆12Dec 14, 2020Updated 5 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- ☆159Jan 9, 2023Updated 3 years ago
- An Optical Character Recognition software based on a simple neural network created from scratch in C.☆19Apr 5, 2019Updated 6 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆45Nov 3, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆97Sep 15, 2021Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆90Mar 5, 2022Updated 4 years ago
- Helper scripts I use to run many experiments in the morning to check at night☆20Jun 14, 2021Updated 4 years ago
- ☆35Apr 8, 2019Updated 6 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆33Jan 30, 2019Updated 7 years ago
- Udemy: Shell Scripting and Command Line Tasks☆12Mar 13, 2018Updated 8 years ago
- PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。☆14May 17, 2021Updated 4 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Python toolkit for likelihood-ratio calibration of binary classifiers☆25Feb 21, 2023Updated 3 years ago
- This my implementation of sphereface using Pytorch on MNIST☆10Apr 5, 2019Updated 6 years ago
- Official implementation of AAAI'2022 paper "Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement"☆17Dec 23, 2021Updated 4 years ago
- Unofficial PyTorch Implementation of StarGAN-ZSVC☆14Aug 5, 2021Updated 4 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated 2 years ago
- Tools and documentation about 'A Link to the Past' (GBA) internals.☆11Mar 15, 2020Updated 6 years ago
- 🏆🏅 Repository for the GEB team's winning solutions in the IEEE Hybrid Energy Forecasting and Trading Competition (HEFTCom).☆28Oct 4, 2025Updated 5 months ago