gzhu06/Y-vector

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gzhu06/Y-vector)

gzhu06 / Y-vector

Y-vector: Multiscale Waveform Encoder for Speaker Embedding

☆24

Alternatives and similar repositories for Y-vector

Users that are interested in Y-vector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

theolepage / ssl-for-slr
View on GitHub
Collection of self-supervised models for speaker and language recognition tasks.
☆19Jan 18, 2022Updated 4 years ago
thuhcsi / icassp2021-emotion-tts
View on GitHub
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Mar 17, 2023Updated 3 years ago
shkim816 / temporal_dynamic_cnn
View on GitHub
TDY-CNN for text-independent speaker verification
☆19Nov 7, 2022Updated 3 years ago
alumae / torch-xvectors-wav
View on GitHub
☆22Jun 30, 2021Updated 5 years ago
cvqluu / MTL-Speaker-Embeddings
View on GitHub
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆26Oct 5, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
miquelindia90 / DoubleAttentionSpeakerVerification
View on GitHub
Pytorch implemenation of the model proposed in the paper: Double Multi-Head Attention for Speaker Verification
☆19Jul 25, 2024Updated last year
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
wngh1187 / RawNeXt
View on GitHub
Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…
☆25Jun 22, 2022Updated 4 years ago
Takaaki-Saeki / simplified_neural_source_filter
View on GitHub
PyTorch implementation of simplified neural source filter model (s-nsf)
☆14Aug 4, 2021Updated 4 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
ranchlai / speaker-verification
View on GitHub
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆98Sep 15, 2021Updated 4 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
Kahsolt / TransTacoS-RetuneGAN
View on GitHub
A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.
☆15May 25, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yuyq96 / D-TDNN
View on GitHub
PyTorch implementation of Densely Connected Time Delay Neural Network
☆91May 4, 2023Updated 3 years ago
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
xcmyz / Tacotron2-Pytorch
View on GitHub
follow NVIDIA, simplify it and support data parallel.
☆13Sep 26, 2019Updated 6 years ago
groadabike / Kaldi-Dsing-task
View on GitHub
DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆19Jul 9, 2026Updated 2 weeks ago
luferrer / DCA-PLDA
View on GitHub
Discriminative Condition-Aware PLDA
☆46Jul 23, 2024Updated 2 years ago
yzyouzhang / Audio_Research_in_US
View on GitHub
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Feb 27, 2026Updated 4 months ago
lwang114 / GraphUnsupASR
View on GitHub
☆10Apr 17, 2024Updated 2 years ago
jaehyeongAN / KoELECTRA-finetuned-sentiment-analysis
View on GitHub
Generalized Sentiment Classifier finetuned by KoELECTRA
☆11Nov 28, 2024Updated last year
prml-lab-speech-team / demo
View on GitHub
☆26Aug 8, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
georgid / AlignmentEvaluation
View on GitHub
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…
☆18Oct 27, 2020Updated 5 years ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago
sigmedia / sp1ny
View on GitHub
☆10Aug 29, 2024Updated last year
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year
linjac / GenDARA
View on GitHub
☆13Jan 14, 2025Updated last year
chitralekha18 / lyrics-aligned-solo-singing-dataset
View on GitHub
☆15Sep 26, 2022Updated 3 years ago
vimalmanohar / old-kaldi-git
View on GitHub
This is not the official kaldi repository. It is better to fork https://github.com/kaldi-asr/kaldi or https://github.com/vimalmanohar/kal…
☆33Aug 6, 2015Updated 10 years ago
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sarulab-speech / xvector_jtubespeech
View on GitHub
xvector model on jtubespeech
☆47Nov 5, 2023Updated 2 years ago
helloooideeeeea / RealTimeCutVADCXXLibrary
View on GitHub
C++ implementation of real-time Voice Activity Detection (VAD) using Silero models with ONNX Runtime and WebRTC Audio Processing. Provide…
☆14Feb 19, 2026Updated 5 months ago
atomicoo / Tacotron2-PyTorch
View on GitHub
PyTorch implementation of Tacotron-2. Tacotron-2 的 PyTorch 实现。
☆14May 17, 2021Updated 5 years ago
gerasimos / doc-rasa-on-m1
View on GitHub
Rasa on M1: installation guideline
☆14Jan 8, 2023Updated 3 years ago
kaistmm / voxceleb-disentangler
View on GitHub
[INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…
☆18Jul 23, 2024Updated 2 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
revsic / torch-whisper-guided-vc
View on GitHub
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Mar 7, 2023Updated 3 years ago