gengxuelong/wenet_LLM_from_ASLP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gengxuelong/wenet_LLM_from_ASLP)

gengxuelong / wenet_LLM_from_ASLP

wenet_LLM_from_ASLP

☆15

Alternatives and similar repositories for wenet_LLM_from_ASLP

Users that are interested in wenet_LLM_from_ASLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ASLP-lab / C2SER
View on GitHub
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆17Mar 3, 2025Updated last year
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆16Mar 12, 2024Updated 2 years ago
xkx-hub / KALL-E
View on GitHub
[AAAI 2026 oral] KALL-E:Autoregressive Speech Synthesis with Next-Distribution Prediction
☆42Sep 25, 2025Updated 10 months ago
NingAnMe / Label-Smoothing-for-CrossEntropyLoss-PyTorch
View on GitHub
add a Arg: label_smoothing for torch.nn.CrossEntropyLoss()
☆14Jan 13, 2021Updated 5 years ago
liutaocode / DiarizationVisualization
View on GitHub
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
lvrysis / Audio-DNN-Classification
View on GitHub
Deep Neural Networks for audio classification
☆10Apr 11, 2024Updated 2 years ago
DeepPSP / cinc2022
View on GitHub
Heart Murmur Detection from Phonocardiogram Recordings: The George B. Moody PhysioNet Challenge 2022
☆15Jan 6, 2026Updated 6 months ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
xkx-hub / ISCSLP2024_CoVoC_baseline
View on GitHub
☆13Jun 8, 2024Updated 2 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
ASLP-lab / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆47Mar 10, 2025Updated last year
HLTCHKUST / ASCEND
View on GitHub
ASCEND Chinese-English code-switching dataset
☆33Jul 12, 2022Updated 4 years ago
ASLP-lab / MINT-Bench
View on GitHub
☆49May 2, 2026Updated 2 months ago
ASLP-lab / LLaSA_Plus
View on GitHub
Llasa Speed Up
☆64Jan 18, 2026Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
guozixunnicolas / DENT_DDSP
View on GitHub
☆24Jun 30, 2023Updated 3 years ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆18Jun 16, 2026Updated last month
wenet-e2e / wesr
View on GitHub
We Speech Transcript based on LLM, in 300 lines of code.
☆182Jun 20, 2025Updated last year
diego-fustes / asr-rescoring
View on GitHub
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Sep 23, 2020Updated 5 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
ayushkumartarun / deep-regression-unlearning
View on GitHub
Official repo of the paper Deep Regression Unlearning accepted in ICML 2023
☆16Jun 14, 2023Updated 3 years ago
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
SXU-YaxinGuo / CRMU
View on GitHub
儿童故事常识推理与寓意理解评测（Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories，CRMU）
☆18Oct 22, 2024Updated last year
ASLP-lab / Automatic-Song-Aesthetics-Evaluation-Challenge
View on GitHub
☆15Dec 14, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Jimmy-di / camouflage-poisoning
View on GitHub
Camouflage poisoning via machine unlearning
☆19Jul 3, 2025Updated last year
mubingshen / MLC-SLM-Baseline
View on GitHub
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆51May 14, 2025Updated last year
ASLP-lab / VoiceSculptor
View on GitHub
An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.
☆250Feb 26, 2026Updated 5 months ago
X-LANCE / public_talks
View on GitHub
Materials of public talks given By SJTU X-LANCE members
☆14Dec 3, 2022Updated 3 years ago
ASLP-lab / MSU-Bench
View on GitHub
Open repository of "MSU-Bench: Towards Understanding the Conversational Multi-Speaker Scenarios"
☆20Jul 7, 2026Updated 3 weeks ago
ASLP-lab / SmartGlasses
View on GitHub
This challenge focuses on evaluating speech recognition and semantic understanding capabilities of AI glasses in complex real-world envir…
☆18Jun 27, 2026Updated last month
Nathan-Roll1 / PSST
View on GitHub
Prosodic Speech Segmentation with Transformers
☆28Feb 25, 2024Updated 2 years ago
JackSyu / Discriminative-Multi-modality-Speech-Recognition
View on GitHub
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
☆26Apr 27, 2022Updated 4 years ago
kamilakesbi / DiarizersLM
View on GitHub
☆15Jul 16, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
maple-research-lab / RemeDi
View on GitHub
Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS". [ICLR 2…
☆15Jan 28, 2026Updated 6 months ago
ASLP-lab / Hum-Dial
View on GitHub
ICASSP2026 HumDial Challenge
☆51May 28, 2026Updated 2 months ago
kaihuhuang / Language-Group
View on GitHub
☆11Dec 24, 2024Updated last year
xingchenwan / nasbowl
View on GitHub
[ICLR '21] Interpretable Neural Architecture Search using Bayesian Optimisation with Weisfiler-Lehman Kernel (NAS-BOWL)
☆23Dec 27, 2021Updated 4 years ago
ASLP-lab / FMSU-Bench
View on GitHub
Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model
☆25May 21, 2026Updated 2 months ago
vsingh-group / LCODEC-deep-unlearning
View on GitHub
Code for CVPR22 paper "Deep Unlearning via Randomized Conditionally Independent Hessians"
☆25Jul 9, 2022Updated 4 years ago