kyutai-labs / sphn
python bindings for symphonia/opus - read various audio formats from python and write opus files
☆56Updated 2 weeks ago
Alternatives and similar repositories for sphn:
Users that are interested in sphn are comparing it to the libraries listed below
- A small rust-based data loader☆24Updated 4 months ago
- Audio tokenization, in the fastest way possible!☆50Updated 7 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated last month
- Open TTS models, built for streaming on the edge☆39Updated 3 weeks ago
- Rust crate for some audio utilities☆22Updated last month
- A simple, hackable text-to-speech system in PyTorch and MLX☆147Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆64Updated this week
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 5 months ago
- ANE accelerated embedding models!☆16Updated 4 months ago
- ☆26Updated 3 months ago
- ☆86Updated this week
- Tokun to can tokens☆16Updated this week
- Experiments with BitNet inference on CPU☆53Updated last year
- ☆47Updated 2 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 5 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆94Updated last week
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated 11 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆67Updated this week
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 6 months ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 9 months ago
- Speaker Diarization with Transformers☆64Updated 10 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆93Updated last month
- Collection of Open Source Speech Data☆153Updated 5 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- Load compute kernels from the Hub☆113Updated this week
- ☆39Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago