Wadaboa/titanet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Wadaboa/titanet)

Wadaboa / titanet

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

☆69

Alternatives and similar repositories for titanet

Users that are interested in titanet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tango4j / llm_speaker_tagging
View on GitHub
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
yuyq96 / D-TDNN
View on GitHub
PyTorch implementation of Densely Connected Time Delay Neural Network
☆91May 4, 2023Updated 3 years ago
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
Xflick / EEND_PyTorch
View on GitHub
A PyTorch implementation of End-to-End Neural Diarization
☆110Jun 19, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tango4j / Auto-Tuning-Spectral-Clustering
View on GitHub
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆125Apr 8, 2022Updated 4 years ago
YChenL / DS-TDNN
View on GitHub
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
☆41Aug 31, 2023Updated 2 years ago
nii-yamagishilab / Attention_Backend_for_ASV
View on GitHub
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Oct 27, 2022Updated 3 years ago
1qh / StreamlitVision
View on GitHub
Web UI for seamless interaction with various Computer Vision tasks, featuring highly configurable visual elements.
☆13Mar 3, 2025Updated last year
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
msh9184 / contrastive-equilibrium-learning
View on GitHub
☆21Apr 6, 2021Updated 5 years ago
jagabandhumishra / W2V-E2E-Language-Diarization
View on GitHub
☆11Sep 4, 2023Updated 2 years ago
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zabir-nabil / awesome-speaker-recognition-verification
View on GitHub
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
☆15Aug 29, 2021Updated 4 years ago
BUTSpeechFIT / EEND
View on GitHub
☆95Apr 24, 2025Updated last year
msh9184 / ska-tdnn
View on GitHub
☆26Nov 2, 2022Updated 3 years ago
alumae / sv_score_calibration
View on GitHub
Score calibration for speaker verification
☆25Dec 13, 2019Updated 6 years ago
BUTSpeechFIT / DiaPer
View on GitHub
☆69Feb 8, 2024Updated 2 years ago
nttcslab-sp / EEND-vector-clustering
View on GitHub
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆81Oct 18, 2022Updated 3 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
joonson / voxconverse
View on GitHub
Spot the conversation: speaker diarisation in the wild
☆171Jul 26, 2022Updated 4 years ago
WWH98932 / Audio-Classification-Models
View on GitHub
Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.
☆24Sep 27, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
antklen / idrnd_antispoofing_solution
View on GitHub
2nd place solution for ID R&D Voice Antispoofing Challenge
☆15Aug 22, 2019Updated 6 years ago
ranchlai / speaker-verification
View on GitHub
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆97Sep 15, 2021Updated 4 years ago
koshian2 / inception-vae
View on GitHub
Variational Auto Encoder using Inception module in PyTorch
☆21Sep 19, 2018Updated 7 years ago
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
FrenchKrab / IS2023-powerset-diarization
View on GitHub
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆96Oct 18, 2023Updated 2 years ago
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
zhaoyi2 / xvector-cnceleb
View on GitHub
kaldi based x-vector trained on Cn-Celeb
☆13Sep 22, 2020Updated 5 years ago
wq2012 / SpectralCluster
View on GitHub
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
☆553Sep 25, 2024Updated last year
VinAIResearch / PhoST
View on GitHub
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
☆26Jun 5, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
wenet-e2e / wespeaker
View on GitHub
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
☆1,369Jul 8, 2026Updated 3 weeks ago
terry-yip / speech-to-text
View on GitHub
Speaker diarization and speech to text
☆14Dec 17, 2020Updated 5 years ago
nryant / dscore
View on GitHub
Diarization scoring tools.
☆268Apr 8, 2026Updated 3 months ago
tango4j / Python-Speaker-Diarization
View on GitHub
Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"
☆11Apr 6, 2020Updated 6 years ago
fsoft-ailab / Poem-Generator
View on GitHub
☆35Aug 27, 2021Updated 4 years ago
tranquyenbk173 / BERT_ITE
View on GitHub
Official implementation of "From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short t…
☆19Oct 16, 2025Updated 9 months ago
kirbyj / vPhon
View on GitHub
A Vietnamese phonetizer
☆55May 29, 2024Updated 2 years ago