SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"
β229Mar 14, 2026Updated last week
Alternatives and similar repositories for slamkit
Users that are interested in slamkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for the SALMonπ£ benchmark (ICASSP 2025 - Oral)β49Aug 15, 2025Updated 7 months ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730β131Dec 8, 2023Updated 2 years ago
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.β34Jun 30, 2024Updated last year
- The official repo of the paper "StressTest: Can YOUR Speech LM Handle the Stress?"β20Jul 9, 2025Updated 8 months ago
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.β20Jan 22, 2026Updated 2 months ago
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).β63Jul 1, 2025Updated 8 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11β¦β46Jul 2, 2024Updated last year
- β46Jul 7, 2025Updated 8 months ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).β85Apr 15, 2025Updated 11 months ago
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learningβ17Jan 21, 2025Updated last year
- Implementation of the paper Silent Killerβ25Mar 18, 2024Updated 2 years ago
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'β156Mar 24, 2025Updated last year
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.β39Jun 6, 2024Updated last year
- β20Mar 5, 2026Updated 2 weeks ago
- β19Jan 8, 2025Updated last year
- An official PyTorch implementation for CLIPPRβ30Jul 22, 2023Updated 2 years ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"β108Jun 12, 2025Updated 9 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Modelsβ61Jul 1, 2025Updated 8 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)β254Mar 26, 2025Updated 11 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Modelsβ35Oct 13, 2024Updated last year
- VoiceBench: Benchmarking LLM-Based Voice Assistantsβ340Mar 16, 2026Updated last week
- Annotatability, a method to identify meaningful patterns in single-cell genomics data through annotation-trainability analysis, which estβ¦β19Jun 23, 2025Updated 9 months ago
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)β37Jul 24, 2025Updated 8 months ago
- β411Oct 3, 2025Updated 5 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024β16Nov 19, 2024Updated last year
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAMβ17Nov 7, 2024Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Mβ¦β20Jan 3, 2023Updated 3 years ago
- A spoken version of the textual story cloze benchmarkβ20Aug 6, 2023Updated 2 years ago
- Top papers related to LLM-based agent evaluationβ89Oct 21, 2025Updated 5 months ago
- An official implementation of ProbeGenβ13Oct 20, 2024Updated last year
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.β18Apr 19, 2025Updated 11 months ago
- Official PyTorch Implementation for the "A Deep Inverse-Mapping Model for a Flapping Robotic Wing" Paper (ICLR 2025)β21Dec 16, 2025Updated 3 months ago
- Real-Time Deepfake Detection in the Real-Worldβ47Nov 30, 2024Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995β79Dec 3, 2024Updated last year
- β38Apr 3, 2025Updated 11 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).β95Oct 8, 2025Updated 5 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterinβ¦β64May 19, 2023Updated 2 years ago
- We Speech Transcript based on LLM, in 300 lines of code.β185Jun 20, 2025Updated 9 months ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.β423Feb 12, 2026Updated last month