ga642381 / SpeechGen
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
☆74Updated last year
Related projects: ⓘ
- Official release of StyleTalk dataset.☆53Updated 2 months ago
- The open source code for LLM-Codec☆106Updated last month
- ☆62Updated 8 months ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆56Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆32Updated last year
- multilingual speech aligner☆70Updated 10 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆41Updated last year
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆60Updated 6 months ago
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆80Updated 11 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated 10 months ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆28Updated 8 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ConMamba for Automatic Speech Recognition☆38Updated last month
- Speech samples and code of BEdit-TTS☆32Updated 11 months ago
- ☆22Updated 2 months ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆127Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆33Updated last week
- E2E TTS using Conditional Flow Matching (Experimental*)☆65Updated 10 months ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- ☆21Updated 6 months ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆97Updated last year
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆146Updated 2 months ago
- ☆60Updated 2 years ago
- ☆69Updated this week
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆58Updated 5 months ago
- ☆52Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- TTS Text Analyzer☆31Updated last year
- ☆23Updated this week