argmaxinc / SDBench
Open-source and reproducible benchmarks for Speaker Diarization
☆20Updated last week
Alternatives and similar repositories for SDBench:
Users that are interested in SDBench are comparing it to the libraries listed below
- ☆62Updated 9 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 10 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 6 months ago
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- Open TTS models, built for streaming on the edge☆39Updated last month
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Video+code lecture on building nanoGPT from scratch☆65Updated 10 months ago
- Rust crate for some audio utilities☆22Updated last month
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- 🧩 AI Components as Building Blocks.☆20Updated last week
- ☆26Updated 4 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆58Updated this week
- Thin wrapper around GGML to make life easier☆24Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆47Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- A collection of optimizers for MLX☆35Updated this week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆104Updated 2 weeks ago
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆48Updated last year
- A lightweight Python library for running TTS models with a unified API.☆17Updated 2 months ago
- mlx image models for Apple Silicon machines☆78Updated last week
- A simple, hackable text-to-speech system in PyTorch and MLX☆153Updated 2 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 3 months ago
- Focused on fast experimentation and simplicity☆71Updated 4 months ago
- ☆63Updated 7 months ago
- ☆60Updated 5 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆17Updated 5 months ago