cwang621 / blsp
BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing
☆42Updated 6 months ago
Related projects: ⓘ
- ☆26Updated 2 years ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆25Updated last year
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆61Updated 2 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆36Updated last year
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆37Updated 2 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated 10 months ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Updated 2 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆32Updated last year
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆23Updated last year
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆80Updated 11 months ago
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆34Updated last month
- End-to-end Speech Translation☆35Updated 3 years ago
- The open source code for LLM-Codec☆106Updated last month
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆23Updated 3 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆21Updated 6 months ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆30Updated 3 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆97Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆40Updated last year
- ☆15Updated 2 years ago
- ☆130Updated 2 months ago
- SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆34Updated 2 months ago
- The project for speech translation☆11Updated 11 months ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆66Updated last year
- ☆69Updated this week
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆21Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆37Updated last month
- A fast speech-to-any translation model that supports simultaneous decoding and offers 28× speedup.☆60Updated last month
- ☆15Updated 7 months ago
- End-to-End Speech Processing Toolkit☆11Updated last month
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆127Updated last year