wenet-e2e / llm-papersLinks
List of Large Lanugage Model Papers
☆59Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below
Sorting:
- Implementation of Google's USM speech model in Pytorch☆32Updated 2 weeks ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- ☆44Updated 2 years ago
- Official release of StyleTalk dataset.☆70Updated last year
- ☆104Updated 3 weeks ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 8 months ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆38Updated last year
- Open Source Speech/Text Data on AI☆18Updated 3 years ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆42Updated 2 months ago
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆151Updated this week
- Automatic Speech Recognition at the University of Edinburgh.☆16Updated 4 years ago
- Decoders from Kaldi using OpenFst☆34Updated 2 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆80Updated last year
- Pre-trained grapheme-to-phoneme (G2P) models☆26Updated 4 years ago
- ☆25Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆102Updated 5 months ago
- A ctc decoder for both online and offline asr model☆64Updated last year
- ☆29Updated 9 months ago
- video cut powered by AI☆25Updated 3 years ago
- ☆26Updated last month
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Updated 2 years ago
- ☆20Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆73Updated 3 years ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆125Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Updated last year
- A curated list of awesome papers on contextualizing E2E ASR outputs☆79Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Updated 2 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated 2 months ago
- ☆14Updated last year
- ☆100Updated last month