openaudiolab / LLaSTLinks
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆25Updated 10 months ago
Alternatives and similar repositories for LLaST
Users that are interested in LLaST are comparing it to the libraries listed below
Sorting:
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆35Updated last year
- Official release of StyleTalk dataset.☆66Updated 11 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 6 months ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆28Updated last year
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆45Updated 2 weeks ago
- Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"☆11Updated last month
- ☆25Updated 2 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆58Updated 7 months ago
- Collection of scripts from mHuBERT-147.☆27Updated 7 months ago
- ☆36Updated 2 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated 11 months ago
- ☆13Updated last year
- ☆18Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆47Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆92Updated last month
- multilingual speech aligner☆74Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆13Updated last year
- WavReward: Spoken Dialogue Models With Generalist Reward Evaluators☆40Updated last month
- ☆35Updated last year
- ☆41Updated 2 years ago
- A fast parallel implementation of RNN Transducer.☆12Updated 2 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆73Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆46Updated this week
- ☆32Updated 11 months ago
- A spoken version of the textual story cloze benchmark☆17Updated last year
- Temporary anonymous version☆22Updated last year
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆19Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- A TTS Trained on Universal Audio.☆34Updated 2 weeks ago