kyutai-labs / yomikomiLinks
A small rust-based data loader
☆34Updated 2 months ago
Alternatives and similar repositories for yomikomi
Users that are interested in yomikomi are comparing it to the libraries listed below
Sorting:
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆75Updated last week
- Rust crate for some audio utilities☆26Updated 10 months ago
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 10 months ago
- ☆46Updated 3 months ago
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆112Updated 3 weeks ago
- A Fish Speech implementation in Rust, with Candle.rs☆106Updated 7 months ago
- ☆20Updated 3 months ago
- Modular Rust transformer/LLM library using Candle☆37Updated last year
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Updated 2 years ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- Graph model execution API for Candle☆17Updated 5 months ago
- Inference engine for GLiNER models, in Rust☆81Updated last week
- Open TTS models, built for streaming on the edge☆44Updated 10 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆146Updated 8 months ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆184Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- PyLate efficient inference engine☆69Updated last week
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆138Updated 3 months ago
- Open-source reproducible benchmarks from Argmax☆77Updated this week
- Collection of Open Source Speech Data☆164Updated 3 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆228Updated 8 months ago
- ☆158Updated last month
- implement llava using candle☆15Updated last year
- Datamodels for hugging face tokenizers☆86Updated 2 weeks ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated last month
- ☆90Updated 6 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆89Updated last week
- Speaker Diarization with Transformers☆69Updated 7 months ago
- ☆67Updated last month