wenet-e2e / llm-papersLinks
List of Large Lanugage Model Papers
☆60Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below
Sorting:
- Implementation of Google's USM speech model in Pytorch☆34Updated last week
- Official Code for ParrotTTS☆58Updated last year
- ☆46Updated 2 years ago
- Official release of StyleTalk dataset.☆72Updated last year
- Open Source Speech/Text Data on AI☆19Updated 3 years ago
- ☆21Updated 2 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆38Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- ☆25Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110Updated 8 months ago
- Decoders from Kaldi using OpenFst☆34Updated 5 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 11 months ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Updated 4 years ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆85Updated 7 months ago
- ☆29Updated 11 months ago
- faster inference☆28Updated last year
- video cut powered by AI☆24Updated 3 years ago
- (WIP)long form speech generatoins☆31Updated 9 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆81Updated last year
- Pre-trained grapheme-to-phoneme (G2P) models☆26Updated 4 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Updated 2 years ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Updated last year
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆76Updated last year
- ☆114Updated 3 months ago
- ☆15Updated last year
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆49Updated 4 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 5 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 7 months ago