argmaxinc / SDBenchLinks

Open-source and reproducible benchmarks for Speaker Diarization

☆29

Alternatives and similar repositories for SDBench

Users that are interested in SDBench are comparing it to the libraries listed below

Sorting:

indri-voice / audiotoken
Audio tokenization, in the fastest way possible!
☆52Updated 10 months ago
fakerybakery / simpletts
A lightweight Python library for running TTS models with a unified API.
☆20Updated 4 months ago
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆43Updated 4 months ago
knoriy / CLARA
☆62Updated 11 months ago
Helw150 / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆14Updated last year
facebookresearch / MultiModalExplorer
Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…
☆27Updated last year
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆63Updated last month
kyutai-labs / sphn
python bindings for symphonia/opus - read various audio formats from python and write opus files
☆64Updated 2 months ago
thevoicecompany / gazelle-train
Joint speech-language model - respond directly to audio!
☆30Updated last year
apple / ml-acn-embed
Acoustic Neighbor Embeddings
☆24Updated 7 months ago
lucasnewman / vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆21Updated 8 months ago
efeslab / LiteASR
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
☆115Updated last month
lucasnewman / nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
☆168Updated 4 months ago
hlt-mt / mosel
Collection of Open Source Speech Data
☆159Updated 8 months ago
thomwolf / sesame-explorations
☆29Updated 2 months ago
kyutai-labs / dactory
☆41Updated 2 months ago
JosefAlbers / e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆27Updated 9 months ago
FL33TW00D / coremlprofiler
Profile your CoreML models directly from Python 🐍
☆28Updated 9 months ago
Vaibhavs10 / translate-with-whisper
☆158Updated 2 years ago
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated 8 months ago
huggingface / huggingface-inference-toolkit
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
☆82Updated last week
nateraw / audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆59Updated last year
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆69Updated last year
apple / pytorch-speech-features
☆85Updated last year
plaggy / fast-whisper-server
ASR + diarization model server with speculative decoding
☆62Updated last year
riccardomusmeci / mlx-image
mlx image models for Apple Silicon machines
☆82Updated 3 months ago
erogol / BlaGPT
Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…
☆67Updated this week
sanchit-gandhi / seq2seq-speech
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆36Updated 2 years ago
mogwai / nanodrz
Speaker Diarization with Transformers
☆68Updated last month
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 9 months ago