ZihanZhaoSJTU/LibriSQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZihanZhaoSJTU/LibriSQA)

ZihanZhaoSJTU / LibriSQA

☆39

Alternatives and similar repositories for LibriSQA

Users that are interested in LibriSQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BlueZeros / ReflecTool
View on GitHub
Benchmark, Toolbox, and Reflection-based Method for Clinical Agent
☆22Nov 6, 2024Updated last year
Jack-ZC8 / M3AV-dataset
View on GitHub
[ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
☆24May 29, 2025Updated last year
pixas / TAIA_LLM
View on GitHub
☆17Nov 1, 2024Updated last year
SJTU-OmniAgent / VocalNet
View on GitHub
☆123May 18, 2026Updated 2 months ago
MAGIC-AI4Med / EHR-R1
View on GitHub
☆37May 18, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
pixas / MedSSS
View on GitHub
Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking
☆90Nov 11, 2025Updated 8 months ago
pixas / NoRM
View on GitHub
ICLR 2025
☆30May 21, 2025Updated last year
Jihunlee326 / Pytorch-GANs
View on GitHub
Pytorch implementation of Generative Adversarial Networks (GAN) for ULTRASOUND image.
☆13Sep 12, 2018Updated 7 years ago
tdlhl / RAD
View on GitHub
[NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"
☆27Nov 21, 2025Updated 8 months ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
Speech-Lab-IITM / data2vec-aqc
View on GitHub
Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…
☆13Mar 18, 2024Updated 2 years ago
BlueZeros / AgentEHR
View on GitHub
Agentic System, Tool Use, Electronic Health Record, Large Language Models, Clinical Nature Language Processing
☆24Apr 13, 2026Updated 3 months ago
SonyCSLParis / ssl-singer-identity
View on GitHub
☆69Nov 6, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
MediaBrain-SJTU / GenMedicalEval
View on GitHub
☆86Jan 15, 2024Updated 2 years ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
huangruizhe / ConEC
View on GitHub
☆14Jun 17, 2024Updated 2 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
xfetus / midl2023
View on GitHub
Short paper to Medical Imaging with Deep Learning 2023 (#MIDL2023) > https://arxiv.org/abs/2304.03941
☆12Jul 17, 2023Updated 3 years ago
ituvisionlab / EdVAE
View on GitHub
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
☆14Sep 20, 2024Updated last year
VITA-MLLM / LUCY
View on GitHub
LUCY: Linguistic Understanding and Control Yielding Early Stage of Her
☆60Apr 14, 2025Updated last year
Aofei-Chang / MedHEval
View on GitHub
Repo for preprint 2025 "MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models"
☆16Apr 23, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
slp-rl / SpokenStoryCloze
View on GitHub
A spoken version of the textual story cloze benchmark
☆22Aug 6, 2023Updated 2 years ago
ictnlp / LLaMA-Omni2
View on GitHub
☆278May 19, 2025Updated last year
THU-KEG / Awesome_MOOCs
View on GitHub
This is a repo listing some must-read papers on *AI-driven MOOCs* or *Intelligent Education* published in recent years, mainly contribute…
☆18Jun 8, 2022Updated 4 years ago
Anuttacon / speech_drame
View on GitHub
☆33Nov 4, 2025Updated 8 months ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MediaBrain-SJTU / MING
View on GitHub
明医 (MING)：中文医疗问诊大模型
☆1,162May 23, 2025Updated last year
jaehyeongAN / KoELECTRA-finetuned-sentiment-analysis
View on GitHub
Generalized Sentiment Classifier finetuned by KoELECTRA
☆11Nov 28, 2024Updated last year
shizhediao / T-DNA
View on GitHub
Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…
☆19Jan 12, 2023Updated 3 years ago
asappresearch / simple-tts
View on GitHub
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆57Oct 31, 2023Updated 2 years ago
AaronZ345 / TCSinger2
View on GitHub
PyTorch Implementation of TCSinger 2(ACL 2025): Customizable Multilingual Zero-shot Singing Voice Synthesis
☆182Apr 19, 2026Updated 3 months ago
ffaisal93 / SD-QA
View on GitHub
☆16Feb 10, 2026Updated 5 months ago
Hertin / WavPrompt
View on GitHub
☆37Jun 30, 2022Updated 4 years ago