WWWWxp/Speech-Tokenizer-Papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WWWWxp/Speech-Tokenizer-Papers)

WWWWxp / Speech-Tokenizer-Papers

This repository collects papers related to Speech Tokenizer.

☆18

Alternatives and similar repositories for Speech-Tokenizer-Papers

Users that are interested in Speech-Tokenizer-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zjzser / TraceableSpeech
View on GitHub
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
☆21Apr 18, 2025Updated last year
FakeSoundData / FakeSound
View on GitHub
☆20Jul 19, 2024Updated 2 years ago
ItzJuny / AMSDF
View on GitHub
[T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations
☆31Jul 31, 2024Updated last year
xieyuankun / Codecfake
View on GitHub
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
☆76Dec 13, 2024Updated last year
ADDchallenge / CFAD
View on GitHub
CFAD: A Chinese Dataset for Fake Audio Detection
☆24Jul 3, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xieyuankun / ALLM-ADD-FT-GRPO
View on GitHub
☆18Feb 6, 2026Updated 5 months ago
cyjie429 / RawBMamba
View on GitHub
☆31Jul 15, 2024Updated 2 years ago
xieyuankun / All-Type-ADD
View on GitHub
This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”
☆35Mar 31, 2026Updated 3 months ago
cyjie429 / RegO
View on GitHub
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
☆14Dec 17, 2024Updated last year
Ming-er / Audio-Free-P-Tuning
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
Cecile-hi / Regularized-Adaptive-Weight-Modification
View on GitHub
Continual Learning Method RAWM for ICML 2023
☆23Sep 26, 2024Updated last year
john852517791 / pytorch_lightning_FAD
View on GitHub
This is a general framework for fake audio detection using pytorch lightning
☆30Jul 24, 2025Updated last year
Yuto-Matsunaga / Prompt_Tuning_for_Audio_Deepfake_Detection
View on GitHub
☆13Nov 12, 2024Updated last year
y-ren16 / TiCodec
View on GitHub
☆81Aug 11, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
xieyuankun / VITS-chinese-finetune
View on GitHub
语音合成VITS 纯中文微调
☆12Mar 15, 2023Updated 3 years ago
zjzser / WMCodec
View on GitHub
PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…
☆18Jul 31, 2025Updated 11 months ago
kaistmm / AlignDiT
View on GitHub
[ACM MM 2025] AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation
☆24Oct 28, 2025Updated 9 months ago
HeCheng0625 / Diffusion-Speech-Tokenizer
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆198Jan 25, 2026Updated 6 months ago
Liu-Tianchi / Nes2Net
View on GitHub
☆111Apr 4, 2026Updated 3 months ago
QiShanZhang / SLSforASVspoof-2021-DF
View on GitHub
Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier
☆71Feb 7, 2025Updated last year
y-ren16 / OV-InstructTTS
View on GitHub
☆22Jan 27, 2026Updated 6 months ago
ucas-hao / qwen_audio_for_add
View on GitHub
[ACMMM2025] Official released code for ALLM4ADD
☆44Oct 30, 2025Updated 8 months ago
Ming-er / MGA-CLAP
View on GitHub
official implementation of MGA-CLAP (ACM MM 2024)
☆29Oct 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
roger-tseng / CodecFake
View on GitHub
A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024
☆22Jul 27, 2024Updated 2 years ago
ta012 / SSLAM
View on GitHub
[ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes
☆79Oct 8, 2025Updated 9 months ago
xieyuankun / FSD-Dataset
View on GitHub
This repository presents FSD dataset for song deepfake detection.
☆24Aug 18, 2025Updated 11 months ago
SVDDChallenge / CtrSVDD_Utils
View on GitHub
☆18Jan 10, 2024Updated 2 years ago
Ruiqi-Yan / URO-Bench
View on GitHub
Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models
☆55Sep 2, 2025Updated 10 months ago
xjchenGit / MTDVocaLiST
View on GitHub
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
☆29Apr 3, 2024Updated 2 years ago
chenjianyi / fastsag
View on GitHub
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
☆29Dec 19, 2024Updated last year
ZhikangNiu / A-DMA
View on GitHub
[INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"
☆67Jun 16, 2025Updated last year
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆35May 22, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
piotrkawa / attack-agnostic-dataset
View on GitHub
Implementation of Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection paper
☆62Aug 15, 2023Updated 2 years ago
KdaiP / DC-Speech-VAE
View on GitHub
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Nov 19, 2025Updated 8 months ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
geomachine / geomachine
View on GitHub
Config files for my GitHub profile.
☆11Updated this week
asvspoof-challenge / asvspoof5
View on GitHub
☆75Sep 15, 2024Updated last year
apple-yinhan / Noise-robust-SED
View on GitHub
☆14Jan 2, 2025Updated last year