wenet-e2e / llm-papers
List of Large Lanugage Model Papers
☆55Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llm-papers
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆61Updated 4 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆81Updated last week
- Implementation of Google's USM speech model in Pytorch☆25Updated last week
- one script for xls-r/xlsr/whisper fine-tuning☆39Updated last year
- Decoders from Kaldi using OpenFst☆26Updated 2 months ago
- video cut powered by AI☆25Updated 2 years ago
- Official Code for ParrotTTS☆43Updated last month
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆26Updated last year
- Official release of StyleTalk dataset.☆57Updated 4 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆72Updated 5 months ago
- ☆22Updated 9 months ago
- ☆33Updated 2 years ago
- neural network based speaker embedder☆25Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- ConMamba for Automatic Speech Recognition☆44Updated 3 months ago
- Automatic Speech Recognition at the University of Edinburgh.☆17Updated 3 years ago
- ☆41Updated last year
- ☆20Updated 3 months ago
- ☆19Updated last year
- ☆31Updated 2 weeks ago
- ☆26Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆66Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆44Updated last year
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆26Updated 8 months ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆50Updated 2 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆20Updated 3 months ago