mechanicalsea/sugar

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mechanicalsea/sugar)

mechanicalsea / sugar

Efficient Speech Processing Tookit for Automatic Speaker Recognition

☆17

Alternatives and similar repositories for sugar

Users that are interested in sugar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flashlight / sequence
View on GitHub
Sequence algorithms for use in Flashlight.
☆14Jan 12, 2026Updated 6 months ago
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
seongmin-kye / meta-SR
View on GitHub
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆73Sep 16, 2020Updated 5 years ago
dmlguq456 / NeXt_TDNN_ASV
View on GitHub
Official repository of NeXt-TDNN for speaker verification
☆84Oct 10, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
nishithbsk / ConflictPrediction
View on GitHub
Predicting Political Instability and Social Conflicts Using Multimodal Data
☆10Jun 6, 2016Updated 10 years ago
salesforce / speech-datasets
View on GitHub
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…
☆15Jun 25, 2026Updated last month
smallflyingpig / learning-to-fool-the-speaker-recognition
View on GitHub
code for paper "learning to fool the speaker recognition"
☆10Jun 12, 2020Updated 6 years ago
mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
yuhogun0908 / AEC
View on GitHub
Acoustic Echo Cancellation
☆14May 29, 2022Updated 4 years ago
s-nlp / parallel_detoxification_dataset
View on GitHub
Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper
☆14Apr 3, 2025Updated last year
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
mechanicalsea / lighthubert
View on GitHub
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆73Sep 26, 2022Updated 3 years ago
TaoRuijie / Loss-Gated-Learning
View on GitHub
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆92May 29, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VoxBlink2 / ScriptsForVoxBlink2
View on GitHub
Official Repository For VoxBlink2
☆88Aug 13, 2024Updated last year
narVidhai / Speech-Transcription-Benchmarking
View on GitHub
Example python scripts to evaluate various ASR methods
☆11Dec 22, 2021Updated 4 years ago
UniversalDataTool / udt-format
View on GitHub
A simple universal data description format for datasets, tailored for interfacing with humans.
☆25Feb 16, 2021Updated 5 years ago
pjones / nix-hs
View on GitHub
Haskell + nixpkgs = nix-hs
☆24Jun 2, 2021Updated 5 years ago
JunhoKim94 / ASR_project
View on GitHub
This repository created for the NHN ASR hackathon competition.
☆11Sep 20, 2023Updated 2 years ago
lpeterse / haskell-ssh
View on GitHub
An SSH implemenation in pure Haskell
☆17Feb 14, 2022Updated 4 years ago
vadimkantorov / tfcheckpoint2pytorch
View on GitHub
Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON
☆18Feb 26, 2021Updated 5 years ago
VITA-Group / AutoSpeech
View on GitHub
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …
☆206Dec 8, 2022Updated 3 years ago
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
clianor / voice-speaker-tensorflow
View on GitHub
책 읽어주는 딥러닝을 보고 나도 만들고 싶어져서 공부하며 만드는 repository입니다.
☆10Dec 8, 2022Updated 3 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
WiraDKP / pytorch_gru_speaker_diarization
View on GitHub
Speaker Diarization using GRU in PyTorch
☆11Aug 29, 2020Updated 5 years ago
TaoRuijie / ECAPA-TDNN
View on GitHub
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
☆823Apr 11, 2024Updated 2 years ago
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
aix64-main / LLMs
View on GitHub
Transformers, LLM, Prompt Engineering, In-Context Learning, RAG, SFT, RLHF
☆10Nov 23, 2024Updated last year
kaiidams / voice100
View on GitHub
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…
☆28Nov 23, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
vlievin / gan-experiments-pytorch
View on GitHub
Experiments with GAN, WGAN, WGAN-GP, DC-GAN, cGAN, AC,GAN and pix2pix
☆10May 28, 2019Updated 7 years ago
Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
valiakon / MultimodalAnalysis_SpeakerDiarization
View on GitHub
The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face im…
☆16Feb 10, 2019Updated 7 years ago
Taeu / HeLP-Challenge-Goldenpass
View on GitHub
☆11Mar 12, 2019Updated 7 years ago
bergey / ghc-passes-graph
View on GitHub
☆26Nov 19, 2020Updated 5 years ago
hechmik / voxceleb_enrichment_age_gender
View on GitHub
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
☆73Dec 18, 2021Updated 4 years ago