wenet-e2e / llm-papersLinks
List of Large Lanugage Model Papers
☆59Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below
Sorting:
- Implementation of Google's USM speech model in Pytorch☆31Updated 3 weeks ago
- Official release of StyleTalk dataset.☆69Updated last year
- Open Source Speech/Text Data on AI☆18Updated 3 years ago
- Official Code for ParrotTTS☆55Updated 11 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆76Updated 2 years ago
- ☆25Updated 2 years ago
- ☆44Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆98Updated 4 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 7 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆79Updated last year
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54Updated 3 years ago
- ☆99Updated this week
- Automatic Speech Recognition at the University of Edinburgh.☆16Updated 4 years ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆37Updated last month
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Updated 3 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆34Updated last year
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆52Updated last week
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆19Updated 11 months ago
- ☆33Updated last year
- Decoders from Kaldi using OpenFst☆32Updated last month
- video cut powered by AI☆25Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆47Updated last year
- (WIP)long form speech generatoins☆31Updated 6 months ago
- ☆44Updated 4 years ago
- A fast parallel implementation of RNN Transducer.☆12Updated 5 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆70Updated 2 months ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆75Updated 3 months ago
- ☆28Updated 3 months ago
- A ctc decoder for both online and offline asr model☆64Updated last year
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 4 years ago