argmaxinc / SDBenchLinks
Open-source and reproducible benchmarks for Speaker Diarization
β27Updated last week
Alternatives and similar repositories for SDBench
Users that are interested in SDBench are comparing it to the libraries listed below
Sorting:
- A lightweight Python library for running TTS models with a unified API.β20Updated 4 months ago
- Profile your CoreML models directly from Python πβ28Updated 8 months ago
- Open TTS models, built for streaming on the edgeβ43Updated 3 months ago
- Joint speech-language model - respond directly to audio!β30Updated last year
- β38Updated last month
- Audio tokenization, in the fastest way possible!β52Updated 10 months ago
- β62Updated 11 months ago
- β29Updated last month
- β26Updated 6 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus filesβ64Updated last month
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β78Updated 2 weeks ago
- A simple, hackable text-to-speech system in PyTorch and MLXβ166Updated 4 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scrollβ¦β27Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!β25Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLXβ27Updated 8 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open dataβ21Updated 10 months ago
- mlx image models for Apple Silicon machinesβ81Updated 2 months ago
- Proof of concept for running moshi/hibiki using webrtcβ19Updated 3 months ago
- β47Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β62Updated 3 weeks ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- Acoustic Neighbor Embeddingsβ24Updated 6 months ago
- Rust crate for some audio utilitiesβ24Updated 3 months ago
- Google TPU optimizations for transformers modelsβ113Updated 5 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.β47Updated last week
- β15Updated 3 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ80Updated last month
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximationβ110Updated last month
- Trying to build an all in one speech-text language model - a bit like GPT-4oβ22Updated last year
- β11Updated 2 months ago