kyutai-labs/sphn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyutai-labs/sphn)

kyutai-labs / sphn

python bindings for symphonia/opus - read various audio formats from python and write opus files

☆80

Alternatives and similar repositories for sphn

Users that are interested in sphn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyutai-labs / yomikomi
View on GitHub
A small rust-based data loader
☆37Jul 17, 2026Updated last week
LaurentMazare / glim
View on GitHub
☆19Dec 31, 2025Updated 6 months ago
kyutai-labs / kaudio
View on GitHub
Rust crate for some audio utilities
☆32Jun 17, 2026Updated last month
kyutai-labs / moshi-webrtc
View on GitHub
Proof of concept for running moshi/hibiki using webrtc
☆21Feb 28, 2025Updated last year
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
kyutai-labs / jax-flash-attn3
View on GitHub
JAX bindings for the flash-attention3 kernels
☆23Jan 2, 2026Updated 6 months ago
julien-c / trainer-proposal
View on GitHub
☆13Mar 27, 2020Updated 6 years ago
huggingface / ember
View on GitHub
ANE accelerated embedding models!
☆20Dec 11, 2024Updated last year
EricLBuehler / candle_graphs
View on GitHub
Graph model execution API for Candle
☆18Jul 27, 2025Updated last year
kyutai-labs / moshi-finetune
View on GitHub
☆475Oct 3, 2025Updated 9 months ago
gradium-ai / gradium-py
View on GitHub
Python client for the Gradium Voice AI api.
☆32Updated this week
LaurentMazare / ug
View on GitHub
Experimental compiler for deep learning models
☆75Sep 18, 2025Updated 10 months ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
alphacep / awesome-speech
View on GitHub
Resources that make every language unique
☆32Jul 20, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kyutai-labs / moshivis
View on GitHub
Kyutai with an "eye"
☆252Mar 26, 2025Updated last year
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
kyutai-labs / moshi-swift
View on GitHub
☆141Jun 26, 2025Updated last year
NVIDIA / audio-intelligence
View on GitHub
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…
☆137Mar 3, 2026Updated 4 months ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
LaurentMazare / syncarp
View on GitHub
An async rpc implementation based on tokio and compatible with OCaml Async_rpc
☆11Feb 13, 2023Updated 3 years ago
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
LaurentMazare / tboard-rs
View on GitHub
Read and write tensorboard data using Rust
☆23Feb 4, 2024Updated 2 years ago
lucadellalib / focalcodec
View on GitHub
A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation
☆173Nov 30, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
nvidia-riva / nemo2riva
View on GitHub
NeMo -> Riva Conversion Tool
☆26Nov 17, 2025Updated 8 months ago
boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆361Jun 25, 2026Updated last month
dynilib / dynitag
View on GitHub
Collaborative audio annotation tool
☆17Sep 16, 2022Updated 3 years ago
yukara-ikemiya / floss-torch
View on GitHub
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆97Nov 24, 2025Updated 8 months ago
ShoukanLabs / VoPho
View on GitHub
A collection of all our phonemeizers for dataset construction and inference
☆30Feb 21, 2025Updated last year
areski / freeswitch_realtime
View on GitHub
Push FreeSWITCH Realtime info to InfluxDB & PostgreSQL
☆15Jul 7, 2020Updated 6 years ago
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
LAION-AI / Desktop-BUD-E_V1.0
View on GitHub
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆23Oct 10, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
LaurentMazare / gemm-metal
View on GitHub
☆20Nov 19, 2024Updated last year
TUIlmenauAMS / FilterBanks_PythonKerasNeuralNetworkImplemention
View on GitHub
Filter Bank Implementaion as Convolutional Neural Network using Python Keras
☆17Dec 18, 2024Updated last year
sp-uhh / gen-se-demo
View on GitHub
Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization
☆14Dec 21, 2024Updated last year
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago