Berkeley-Speech-Group/sylber

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Berkeley-Speech-Group/sylber)

Berkeley-Speech-Group / sylber

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

☆80

Alternatives and similar repositories for sylber

Users that are interested in sylber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cheoljun95 / sdhubert
View on GitHub
☆27Dec 4, 2024Updated last year
Berkeley-Speech-Group / Speech-Articulatory-Coding
View on GitHub
☆65May 29, 2025Updated last year
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year
bovod-sjtu / HoliTok
View on GitHub
HoliTok:A Coutinuous Holistic Tokenization with Robust Dual Capabilities of Speech Generation and Understanding
☆39Jun 8, 2026Updated last month
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆72Aug 15, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
sony / soundctm
View on GitHub
Pytorch implementation of SoundCTM
☆101Mar 31, 2025Updated last year
mubtasimahasan / DM-Codec
View on GitHub
Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”
☆57Jun 1, 2025Updated last year
skinahan / DIVA_PyTorch
View on GitHub
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆23Jan 18, 2023Updated 3 years ago
lwang114 / GraphUnsupASR
View on GitHub
☆10Apr 17, 2024Updated 2 years ago
ljuvela / SourceFilterNeuralFormants
View on GitHub
☆21Sep 20, 2024Updated last year
hyama5 / vae_align
View on GitHub
Alignment examples for Interspeech 2024
☆28Jul 5, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
andybi7676 / reborn-uasr
View on GitHub
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
☆15Dec 11, 2024Updated last year
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆126Mar 20, 2025Updated last year
ryota-komatsu / speaker_disentangled_hubert
View on GitHub
Official repository of the IEEE OJSP paper "Speaker-Disentangled Chunk-Wise Regression for Syllabic Tokenization"
☆46Updated this week
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
wavlab-speech / versa
View on GitHub
Versatile Evaluation of Speech and Audio
☆425Jul 21, 2026Updated last week
lonce / SPSI_Python
View on GitHub
Single Pass Spectrogram Inversion in a Jupyter Python notebook
☆34Aug 10, 2017Updated 8 years ago
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
articulatory / articulatory
View on GitHub
Deep Articulatory Synthesis and Inversion
☆57Feb 14, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Auroraaa86 / LCS-CTC
View on GitHub
For IEEE ASRU(2025)
☆15Jun 21, 2025Updated last year
zeyuxie29 / AudioTime
View on GitHub
☆39Jul 4, 2024Updated 2 years ago
kaistmm / fregrad
View on GitHub
[ICASSP 2024] Official code for FreGrad
☆35May 13, 2024Updated 2 years ago
Stability-AI / stable-codec
View on GitHub
A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
☆437Jul 17, 2026Updated last week
Berkeley-Speech-Group / DysfluentWFST
View on GitHub
DysfluentWFST
☆19Nov 13, 2025Updated 8 months ago
ajd12342 / paraspeechcaps
View on GitHub
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆165Mar 26, 2026Updated 4 months ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
ankitapasad / layerwise-analysis
View on GitHub
Layer-wise analysis of self-supervised pre-trained speech representations
☆135Oct 18, 2024Updated last year
aask1357 / hilcodec
View on GitHub
High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec
☆120Jun 23, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / spidr
View on GitHub
This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…
☆57Updated this week
lucadellalib / focalcodec
View on GitHub
A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation
☆173Nov 30, 2025Updated 7 months ago
jasonppy / syllable-discovery
View on GitHub
Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
☆35Aug 27, 2023Updated 2 years ago
iamycy / diffwave-sr
View on GitHub
☆87May 21, 2023Updated 3 years ago
X-LANCE / UniCATS-CTX-vec2wav
View on GitHub
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Jun 11, 2024Updated 2 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
apple / ml-omni-router-moe-asr
View on GitHub
☆18Oct 24, 2025Updated 9 months ago