wenet-e2e / llm-papersLinks
List of Large Lanugage Model Papers
☆59Updated 2 years ago
Alternatives and similar repositories for llm-papers
Users that are interested in llm-papers are comparing it to the libraries listed below
Sorting:
- Implementation of Google's USM speech model in Pytorch☆32Updated 3 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 4 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆73Updated 2 years ago
- Decoders from Kaldi using OpenFst☆30Updated 2 weeks ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54Updated 3 years ago
- neural network based speaker embedder☆25Updated 2 years ago
- Awesome TTS☆59Updated 3 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- video cut powered by AI☆25Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆19Updated 9 months ago
- ☆44Updated last year
- ☆25Updated 2 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆76Updated last year
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp/pp.☆100Updated this week
- ☆20Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37Updated last year
- An effort to track benchmarking results over widely-used datasets for ASR.☆46Updated 3 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- Official Code for ParrotTTS☆52Updated 9 months ago
- asr2k☆51Updated last year
- ☆28Updated 5 months ago
- ☆31Updated 3 months ago
- ☆33Updated 3 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆92Updated last month
- Automatic Speech Recognition at the University of Edinburgh.☆16Updated 4 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- ☆29Updated last week