alumae/torch-xvectors-wav

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alumae/torch-xvectors-wav)

alumae / torch-xvectors-wav

☆22

Alternatives and similar repositories for torch-xvectors-wav

Users that are interested in torch-xvectors-wav are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sveinnpalsson / sourceseparation
View on GitHub
☆12Oct 9, 2025Updated 9 months ago
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago
avryhof / speech_recognition
View on GitHub
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Mar 9, 2022Updated 4 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
alumae / online_speaker_change_detector
View on GitHub
Online streaming speaker change detection model in Pytorch
☆44Apr 14, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
djmoffat / pyCompressor
View on GitHub
A python implementation of a traditional Dynamic Range Compressor
☆14Oct 30, 2020Updated 5 years ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
gzhu06 / Y-vector
View on GitHub
Y-vector: Multiscale Waveform Encoder for Speaker Embedding
☆24Jul 16, 2024Updated 2 years ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
onolab-tmu / code_2020ICASSP_five
View on GitHub
Fast Independent Vector Extraction: Code and data to reproduce the results from the paper.
☆25May 7, 2020Updated 6 years ago
SELMA-project / ml4audio
View on GitHub
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Sep 4, 2023Updated 2 years ago
SiddGururani / Pytorch-TDNN
View on GitHub
☆99Dec 20, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
bootphon / learnable-strf
View on GitHub
Learnable STRF, from Riad et al. 2021 JASA
☆13Aug 21, 2021Updated 4 years ago
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 5 months ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
42io / tflite_kws
View on GitHub
☆13May 1, 2026Updated 2 months ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
nglehuy / ctc_decoders
View on GitHub
Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model
☆24Oct 28, 2023Updated 2 years ago
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 4 years ago
jimmy-ren / lstm_speaker_naming_aaai16
View on GitHub
Code to demonstrate multimodal LSTM
☆34Sep 5, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
vinusankars / ESOLA
View on GitHub
Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.
☆23Jul 24, 2020Updated 6 years ago
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
talker93 / oneMinTTS
View on GitHub
Launch your speech synthesis within one minute.
☆12May 6, 2024Updated 2 years ago
tencent-ailab / 3m-asr
View on GitHub
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆119Jun 22, 2022Updated 4 years ago
TartuNLP / tts_preprocess_et
View on GitHub
Estonian text-to-speech text normalization pipeline
☆14Dec 17, 2025Updated 7 months ago
yichen14 / FastAdaSP
View on GitHub
Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆17Nov 14, 2024Updated last year
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago
alumae / sv_score_calibration
View on GitHub
Score calibration for speaker verification
☆25Dec 13, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Scarfmonster / HiFiPLN
View on GitHub
Multispeaker Community Vocoder Model for DiffSinger
☆39Aug 11, 2025Updated 11 months ago
DicioTeam / dicio-skill
View on GitHub
Assistance component base for Dicio assistant components
☆13Apr 23, 2026Updated 3 months ago
hanayashiki / AsrService
View on GitHub
asr service based on kaldi
☆17Dec 8, 2022Updated 3 years ago
taeyoun811 / Whisfusion
View on GitHub
Whisfusion: Parallel ASR Decoding via a Diffusion Transformer
☆31Aug 22, 2025Updated 11 months ago
etri / kmsav
View on GitHub
☆14Oct 25, 2024Updated last year
fgnt / mms_msg
View on GitHub
Multipurpose Multi Speaker Mixture Signal Generator
☆46Feb 6, 2025Updated last year
onolab-tmu / code_2020ICASSP_iss
View on GitHub
Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.
☆22Apr 14, 2020Updated 6 years ago