wenet-e2e / llm-papersLinks
List of Large Lanugage Model Papers
☆59Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below
Sorting:
- Implementation of Google's USM speech model in Pytorch☆33Updated last month
- Official release of StyleTalk dataset.☆70Updated last year
- ☆25Updated 2 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 9 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 3 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆104Updated 6 months ago
- Official Code for ParrotTTS☆58Updated last year
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆81Updated last year
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆45Updated 3 months ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆38Updated last year
- ☆108Updated last month
- Automatic Speech Recognition at the University of Edinburgh.☆16Updated 4 years ago
- (WIP)long form speech generatoins☆31Updated 8 months ago
- ☆44Updated 2 years ago
- Decoders from Kaldi using OpenFst☆34Updated 3 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆41Updated last year
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆73Updated 3 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Updated 4 years ago
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆158Updated 2 weeks ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆75Updated 4 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆77Updated 5 months ago
- ☆56Updated 2 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆66Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Updated 2 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆33Updated last year
- ☆15Updated last year
- ☆44Updated 5 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆34Updated 6 months ago