SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"
β230Mar 14, 2026Updated last month
Alternatives and similar repositories for slamkit
Users that are interested in slamkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for the SALMonπ£ benchmark (ICASSP 2025 - Oral)β49Aug 15, 2025Updated 8 months ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730β131Dec 8, 2023Updated 2 years ago
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.β34Jun 30, 2024Updated last year
- The official repo of the paper "StressTest: Can YOUR Speech LM Handle the Stress?"β20Jul 9, 2025Updated 9 months ago
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.β21Jan 22, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official PyTorch Implementation for the "Unsupervised Model Tree Heritage Recovery" paper (ICLR 2025).β63Jul 1, 2025Updated 10 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11β¦β46Jul 2, 2024Updated last year
- β47Jul 7, 2025Updated 9 months ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).β86Apr 15, 2025Updated last year
- TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learningβ17Jan 21, 2025Updated last year
- Implementation of the paper Silent Killerβ25Mar 18, 2024Updated 2 years ago
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'β160Mar 26, 2026Updated last month
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.β39Jun 6, 2024Updated last year
- β20Mar 5, 2026Updated last month
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β19Jan 8, 2025Updated last year
- An official PyTorch implementation for CLIPPRβ31Jul 22, 2023Updated 2 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Modelsβ63Jul 1, 2025Updated 10 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Modelsβ35Oct 13, 2024Updated last year
- Real-time Speech-Text Foundation Model Toolkit (wip)β257Mar 26, 2025Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"β109Jun 12, 2025Updated 10 months ago
- [TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistantsβ356Apr 28, 2026Updated last week
- Annotatability, a method to identify meaningful patterns in single-cell genomics data through annotation-trainability analysis, which estβ¦β19Jun 23, 2025Updated 10 months ago
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)β37Jul 24, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024β16Nov 19, 2024Updated last year
- β436Oct 3, 2025Updated 7 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAMβ17Nov 7, 2024Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Mβ¦β20Jan 3, 2023Updated 3 years ago
- An official implementation of ProbeGenβ13Oct 20, 2024Updated last year
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.β18Apr 19, 2025Updated last year
- Official PyTorch Implementation for the "A Deep Inverse-Mapping Model for a Flapping Robotic Wing" Paper (ICLR 2025)β21Dec 16, 2025Updated 4 months ago
- Real-Time Deepfake Detection in the Real-Worldβ47Nov 30, 2024Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995β79Dec 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A spoken version of the textual story cloze benchmarkβ22Aug 6, 2023Updated 2 years ago
- Top papers related to LLM-based agent evaluationβ90Oct 21, 2025Updated 6 months ago
- β40Apr 3, 2025Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).β97Oct 8, 2025Updated 6 months ago
- We Speech Transcript based on LLM, in 300 lines of code.β185Jun 20, 2025Updated 10 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterinβ¦β65May 19, 2023Updated 2 years ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.β429Feb 12, 2026Updated 2 months ago