Helw150 / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆14Updated 9 months ago
Alternatives and similar repositories for levanter:
Users that are interested in levanter are comparing it to the libraries listed below
- Open TTS models, built for streaming on the edge☆39Updated 2 weeks ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆33Updated last month
- GPT for FACodec☆13Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 4 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 4 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆35Updated 11 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆37Updated this week
- StyleTTS 2 Optimized Training Fork☆26Updated 2 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Updated 4 months ago
- Audio tokenization, in the fastest way possible!☆49Updated 7 months ago
- ☆62Updated 8 months ago
- Implementation of Google's USM speech model in Pytorch☆30Updated 2 months ago
- Dippy Synthetic Speech Subnet☆16Updated this week
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 3 months ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 7 months ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆45Updated 3 weeks ago
- ☆59Updated last year
- My vocoder experiments☆28Updated 5 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 3 weeks ago
- ☆25Updated 5 months ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated last month
- GPT-style network for phonemization with durations of text☆64Updated last year
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆40Updated 2 months ago
- ☆39Updated last month
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆20Updated last week
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated 7 months ago
- ☆104Updated this week
- ☆14Updated last year
- ESLTTS dataset☆16Updated last month