wenet-e2e/llm-papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wenet-e2e/llm-papers)

wenet-e2e / llm-papers

List of Large Lanugage Model Papers

☆59

Alternatives and similar repositories for llm-papers

Users that are interested in llm-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenet-e2e / nn-singal-processing-papers
View on GitHub
List of NN based singal processing papers
☆23Jun 5, 2023Updated 3 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
EdVince / whisper-trtllm
View on GitHub
Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
ahmetaa / kaldi-jni
View on GitHub
Experiment with JNI access to some Kaldi functions.
☆12Dec 31, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MrSupW / ICMC-ASR_Baseline
View on GitHub
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆57Dec 6, 2023Updated 2 years ago
pengzhendong / audiolab
View on GitHub
A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)
☆39Mar 31, 2026Updated 3 months ago
chmod740 / SuperShare
View on GitHub
基于Android平台以及Mina框架实现图片面对面快传的APP
☆11Jan 24, 2017Updated 9 years ago
naxingyu / kaldi_cvte_model_test
View on GitHub
This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)
☆15May 30, 2019Updated 7 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
trongthanhptnk / Dilated_Res_Attention_LSTM
View on GitHub
Simple implement dilated LSTM, residual LSTM and Attention LSTM (follow the corresponding papers).
☆17Dec 26, 2019Updated 6 years ago
aispeech-lab / DiffuseNoiseGeneration
View on GitHub
☆25Nov 23, 2021Updated 4 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
wavlab-speech / cmu_multilingual_speech
View on GitHub
CMU multilingual speech repository
☆30Apr 15, 2022Updated 4 years ago
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
wenet-e2e / wesr
View on GitHub
We Speech Transcript based on LLM, in 300 lines of code.
☆182Jun 20, 2025Updated last year
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
idiap / contextual-biasing-on-gpus
View on GitHub
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆21Sep 25, 2023Updated 2 years ago
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
usnistgov / F4DE
View on GitHub
Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…
☆26Jul 6, 2017Updated 9 years ago
wangkenpu / WSJ2WAV
View on GitHub
Convert WSJ sphere format to waveform and do data simulation.
☆16Feb 20, 2020Updated 6 years ago
lovemefan / fsmn-vad
View on GitHub
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆139Apr 26, 2023Updated 3 years ago
Jackson-Kang / Pytorch-Diffusion-Model-Tutorial
View on GitHub
A simple tutorial of Diffusion Probabilistic Models
☆114Nov 30, 2024Updated last year
snsun / kaldi-decoder-code-reading
View on GitHub
☆33Oct 28, 2022Updated 3 years ago
mispchallenge / MISP-2023-Challenge-Baseline
View on GitHub
☆25Jan 2, 2024Updated 2 years ago
RicherMans / SAT
View on GitHub
Streaming Audiotransformers for online Audio tagging
☆57Jun 14, 2024Updated 2 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
fengpeng-yue / speech-to-speech-translation
View on GitHub
☆25Feb 12, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
glecorve / rnnlm2wfst
View on GitHub
Conversion of recurrent neural network language models to weighted finite state transducers
☆58Jun 1, 2018Updated 8 years ago
JusperLee / speech-paper-daily-skill
View on GitHub
☆26Mar 31, 2026Updated 3 months ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
winlinvip / srs-k2
View on GitHub
Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC
☆20Apr 16, 2023Updated 3 years ago
mr-rigden / pyloudness
View on GitHub
How loud is that file?
☆12Sep 3, 2019Updated 6 years ago
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
timedomain-tech / ACE_phonemes
View on GitHub
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
☆44Jan 17, 2025Updated last year