shkim816/acnn_speaker_recog

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shkim816/acnn_speaker_recog)

shkim816 / acnn_speaker_recog

acnn for text-independent speaker recognition

☆10

Alternatives and similar repositories for acnn_speaker_recog

Users that are interested in acnn_speaker_recog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shkim816 / temporal_dynamic_cnn
View on GitHub
TDY-CNN for text-independent speaker verification
☆19Nov 7, 2022Updated 3 years ago
frednam93 / FilterAugSED
View on GitHub
☆68Sep 13, 2024Updated last year
rishikksh20 / NU-Wave-pytorch
View on GitHub
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
☆37May 25, 2021Updated 5 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
maum-ai / sane-tts
View on GitHub
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
☆11Jun 30, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
maum-ai / wavegrad2
View on GitHub
Unofficial Pytorch Implementation of WaveGrad2
☆111Aug 18, 2021Updated 4 years ago
revsic / torch-whisper-guided-vc
View on GitHub
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Mar 7, 2023Updated 3 years ago
Vaibhavs10 / dcase-2023-workshop
View on GitHub
☆14Sep 20, 2023Updated 2 years ago
sholokhovalexey / online-speaker-clustering
View on GitHub
[ICASSP'23] Online speaker clustering
☆18Feb 22, 2026Updated 5 months ago
CODEJIN / Speaker_Embedding_Torch
View on GitHub
PyTorch based speaker embedding model
☆16Apr 13, 2024Updated 2 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
nishithbsk / ConflictPrediction
View on GitHub
Predicting Political Instability and Social Conflicts Using Multimodal Data
☆10Jun 6, 2016Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
ZeWang95 / ACDA
View on GitHub
Adaptive Convolutions with Per-pixel Dynamic Filter Atom
☆27Sep 3, 2021Updated 4 years ago
junjun3518 / alias-free-torch
View on GitHub
Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample
☆101Jul 26, 2022Updated 4 years ago
Ajyy / inverse_chinese_text_normalization
View on GitHub
将normalize过的中文文本，做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。
☆13Apr 7, 2021Updated 5 years ago
Audio-WestlakeU / RCT
View on GitHub
This repo gives the code for the official implementation of RCT.
☆13Jun 28, 2022Updated 4 years ago
Derpimort / VGGVox-PyTorch
View on GitHub
Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.
☆25Oct 15, 2020Updated 5 years ago
yoyolicoris / kazane
View on GitHub
Simple sinc interpolation in PyTorch.
☆15Jul 8, 2023Updated 3 years ago
voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
alphacep / unimrcp-vosk-plugin
View on GitHub
Open source cross-platform implementation of MRCP protocol
☆20Mar 3, 2022Updated 4 years ago
theolepage / ssl-for-slr
View on GitHub
Collection of self-supervised models for speaker and language recognition tasks.
☆19Jan 18, 2022Updated 4 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
cvqluu / MTL-Speaker-Embeddings
View on GitHub
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆26Oct 5, 2022Updated 3 years ago
khtee / text-classification-pytorch
View on GitHub
Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation
☆10Apr 30, 2021Updated 5 years ago
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
zkzhang98 / CoinSeg
View on GitHub
☆12Oct 24, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
xf739645524 / omlsa_imcra_new_version
View on GitHub
完整基于omlsa.m实现
☆14Nov 26, 2021Updated 4 years ago
facebookresearch / lst
View on GitHub
Code for Latent Speech-Text Transformer (LST)
☆35Mar 12, 2026Updated 4 months ago
Lhx94As / PHO-LID
View on GitHub
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Aug 24, 2023Updated 2 years ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
VITA-Group / Data-Efficient-Scaling
View on GitHub
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang
☆14Jan 4, 2024Updated 2 years ago
Verssae / flask-tacotron2-tts-web-app
View on GitHub
flask+tornado based NVIDIA tacotron2+waveglow tts web app
☆28May 25, 2023Updated 3 years ago