kyutai-labs / sphnLinks
python bindings for symphonia/opus - read various audio formats from python and write opus files
☆64Updated last month
Alternatives and similar repositories for sphn
Users that are interested in sphn are comparing it to the libraries listed below
Sorting:
- Audio tokenization, in the fastest way possible!☆52Updated 9 months ago
- A small rust-based data loader☆24Updated 5 months ago
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- Rust crate for some audio utilities☆23Updated 2 months ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆161Updated 3 months ago
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- ☆62Updated 10 months ago
- Simple high-throughput inference library☆115Updated 3 weeks ago
- Collection of Open Source Speech Data☆157Updated 6 months ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆109Updated 2 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated this week
- ☆37Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆103Updated last week
- Speaker Diarization with Transformers☆64Updated last year
- Experiments with BitNet inference on CPU☆55Updated last year
- Profile your CoreML models directly from Python 🐍☆27Updated 7 months ago
- Open-source and reproducible benchmarks for Speaker Diarization☆25Updated last month
- ☆26Updated 5 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 6 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated 6 months ago
- ☆226Updated 2 months ago
- ☆84Updated last year
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆212Updated 2 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- ANE accelerated embedding models!☆17Updated 5 months ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆112Updated last year
- ☆48Updated 3 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month