wenet-e2e / llm-papersLinks
List of Large Lanugage Model Papers
☆59Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below
Sorting:
- Implementation of Google's USM speech model in Pytorch☆31Updated 2 weeks ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated 2 years ago
- Official Code for ParrotTTS☆53Updated 9 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 5 months ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Updated 4 years ago
- ☆29Updated 6 months ago
- ☆25Updated 2 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆77Updated last year
- ☆56Updated 2 years ago
- Decoders from Kaldi using OpenFst☆31Updated last month
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.☆138Updated 3 weeks ago
- (WIP)long form speech generatoins☆31Updated 4 months ago
- Official release of StyleTalk dataset.☆67Updated last year
- ☆44Updated last year
- ☆20Updated 2 years ago
- ☆27Updated last month
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆32Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆39Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆96Updated 2 months ago
- Towards Comprehensive Benchmark for End-to-End Spoken Dialogue Models☆32Updated 4 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆66Updated 2 weeks ago
- ☆25Updated 9 months ago
- ☆85Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆94Updated 8 months ago
- ☆22Updated 9 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Updated 2 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆19Updated last year
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆128Updated last week