Wadaboa / titanetView external linksLinks
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
☆68Oct 30, 2022Updated 3 years ago
Alternatives and similar repositories for titanet
Users that are interested in titanet are comparing it to the libraries listed below
Sorting:
- ☆53Oct 17, 2023Updated 2 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- Web UI for seamless interaction with various Computer Vision tasks, featuring highly configurable visual elements.☆14Mar 3, 2025Updated 11 months ago
- PyTorch implementation of Densely Connected Time Delay Neural Network☆90May 4, 2023Updated 2 years ago
- A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.☆15Aug 29, 2021Updated 4 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆787Apr 11, 2024Updated last year
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 6 months ago
- A PyTorch implementation of End-to-End Neural Diarization☆109Jun 19, 2023Updated 2 years ago
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Aug 22, 2019Updated 6 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Aug 31, 2023Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Official repository of NeXt-TDNN for speaker verification☆81Oct 10, 2024Updated last year
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆125Apr 8, 2022Updated 3 years ago
- ☆91Apr 24, 2025Updated 9 months ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,203Updated this week
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 8 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Oct 27, 2022Updated 3 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆166Dec 12, 2025Updated 2 months ago
- A Vietnamese phonetizer☆53May 29, 2024Updated last year
- ☆66Feb 8, 2024Updated 2 years ago
- 全国書誌データから作成した振り仮名のデータセット☆28Sep 21, 2021Updated 4 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 3 years ago
- ☆21Apr 6, 2021Updated 4 years ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Dec 14, 2023Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆157Jul 26, 2022Updated 3 years ago
- Sound source localization using SRP-PHAT☆25Feb 17, 2019Updated 6 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆439Aug 12, 2025Updated 6 months ago
- Score calibration for speaker verification☆26Dec 13, 2019Updated 6 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- Diarization scoring tools.☆263Mar 28, 2023Updated 2 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Nov 18, 2021Updated 4 years ago
- Bluetooth plugin for Flutter☆10Dec 19, 2022Updated 3 years ago
- Few-Shot Keyword Spotting☆70Apr 11, 2021Updated 4 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- ☆30Jul 21, 2022Updated 3 years ago