qiuqiangkong/mini_llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qiuqiangkong/mini_llm)

qiuqiangkong / mini_llm

☆29

Alternatives and similar repositories for mini_llm

Users that are interested in mini_llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuqiangkong / music_llm
View on GitHub
☆56Jul 13, 2025Updated last year
qiuqiangkong / audioflow
View on GitHub
☆130Updated this week
AudioFans / audidata
View on GitHub
☆21Apr 24, 2025Updated last year
zeyuxie29 / SemanticVocoder
View on GitHub
☆28Apr 6, 2026Updated 3 months ago
qiuqiangkong / mini_music_tagging
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
qiuqiangkong / materials_for_students
View on GitHub
☆16Aug 10, 2025Updated 11 months ago
zhaoyx239 / X-Translator
View on GitHub
☆26Jul 21, 2026Updated last week
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
Cr-Fish / WESR
View on GitHub
Official implementation of ACL'26 (findings) paper WESR (Word-level Event-Speech Recognition): A comprehensive benchmark and baseline for…
☆39Jan 30, 2026Updated 5 months ago
yongyizang / GSound-SIR
View on GitHub
A Python Room Spatial Impulse Response Ray-Tracing Toolkit
☆86Mar 4, 2026Updated 4 months ago
qiuqiangkong / music_source_separation
View on GitHub
☆60Jun 15, 2026Updated last month
wdqqdw / Echo
View on GitHub
Project page of "2026-ICLR Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning"
☆16Mar 26, 2026Updated 4 months ago
juhayna-zh / AudioControlNet
View on GitHub
Official repository for the paper "Audio ControlNet for Fine-Grained Audio Generation and Editing".
☆77Feb 7, 2026Updated 5 months ago
Labbeti / aac-metrics
View on GitHub
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
☆75Mar 22, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Xia-aaa / L3former
View on GitHub
☆14Jun 26, 2025Updated last year
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 9 months ago
lonzi / mrflow_dpo
View on GitHub
☆22Jan 3, 2026Updated 6 months ago
JishengBai / ICME2024ASC
View on GitHub
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆18Mar 16, 2024Updated 2 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
juhayna-zh / Awesome-Music-Generation-Papers
View on GitHub
Curated list of groundbreaking music generation research.
☆21Apr 24, 2026Updated 3 months ago
xiquan-li / Resonate
View on GitHub
[INTERSPEECH 2026] Pre-training, SFT, DPO and GRPO for Text-to-Audio Generation
☆48Apr 17, 2026Updated 3 months ago
inclusionAI / AudioMCQ
View on GitHub
[ICLR 2026] AudioMCQ: A 571k audio multiple-choice question dataset for post-training Large Audio Language Models with dual CoT annotatio…
☆51Apr 21, 2026Updated 3 months ago
ZhikangNiu / Semantic-VAE
View on GitHub
[INTERSPEECH 2026 Oral]Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
☆121Jun 21, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆146Mar 8, 2026Updated 4 months ago
lizhaoqing / UNISON
View on GitHub
☆43Jun 3, 2026Updated last month
wsntxxn / UniFlow-Audio
View on GitHub
☆74Jul 17, 2026Updated last week
ddlBoJack / Awesome-Speech-Language-Model
View on GitHub
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
☆202Jun 7, 2026Updated last month
lmxue / Audio-FLAN
View on GitHub
Audio-FLAN
☆161Sep 23, 2025Updated 10 months ago
kandinskylab / kvae-audio
View on GitHub
KVAE-Audio: a continuous full-band audio waveform autoencoder
☆102Updated this week
sarulab-speech / SpatialCLAP
View on GitHub
☆19Oct 9, 2025Updated 9 months ago
IsaacYQH / WildFX
View on GitHub
Official implementation of WildFX Dataset Generating pipeline.
☆21Oct 21, 2025Updated 9 months ago
xiquan-li / TinyMU
View on GitHub
[ICASSP 2026] TinyMU: A Compact Audio Language Model for Music Understanding
☆37Apr 20, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yanghaha0908 / WavCube
View on GitHub
Official code for "WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling"
☆62Jun 27, 2026Updated last month
ajd12342 / paraspeechcaps
View on GitHub
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆165Mar 26, 2026Updated 4 months ago
xiquan-li / MeanAudio
View on GitHub
[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆145Sep 2, 2025Updated 10 months ago
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆60Nov 3, 2025Updated 8 months ago
MTG / omar-rq
View on GitHub
Training, validation, and inference code for various SSL approaches and architectures.
☆87Apr 7, 2026Updated 3 months ago
violet-liang / soundfield-reconstruction-np
View on GitHub
Sound field reconstruction using neural processes with dynamic kernels
☆16Mar 25, 2025Updated last year
qiuqiangkong / audio_understanding
View on GitHub
☆131Feb 6, 2025Updated last year