AIDASLab/MathReader

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AIDASLab/MathReader)

AIDASLab / MathReader

Implementation of MathReader, Text-to-Speech for Mathematical Documents

☆33

Alternatives and similar repositories for MathReader

Users that are interested in MathReader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 3 months ago
apple / ml-omni-router-moe-asr
View on GitHub
☆18Oct 24, 2025Updated 9 months ago
yuriak / SpeechDialogueFactory
View on GitHub
☆40Apr 3, 2025Updated last year
jishengpeng / WavReward
View on GitHub
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
☆56May 15, 2025Updated last year
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
Coder-jzq / RADKA-CSS
View on GitHub
☆17Mar 25, 2025Updated last year
line / promptttspp
View on GitHub
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆86Oct 11, 2024Updated last year
vTAD2025-Challenge / vTAD
View on GitHub
☆17Oct 24, 2025Updated 9 months ago
krafton-ai / Raon-Speech
View on GitHub
Open-source speech AI models from KRAFTON, including Raon-Speech and Raon-SpeechChat for speech understanding, generation, and real-time …
☆75Apr 7, 2026Updated 3 months ago
zukijourney / api-docs
View on GitHub
Documentation of the ZukiJourney-API business.
☆12Sep 2, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
koth / EmotiVoice.cpp
View on GitHub
cpp inference for EmotiVoice
☆16Jan 1, 2024Updated 2 years ago
AIDASLab / Medic-AD
View on GitHub
[CVPR 2026 Oral] Official implementation for "MEDIC-AD: Towards Medical Vision-Language Model's Clinical Intelligence"
☆29Apr 9, 2026Updated 3 months ago
yjzxkxdn / Mini-DDSP
View on GitHub
☆16Mar 31, 2025Updated last year
davidmarttila / vocal-tract-grad
View on GitHub
Vocal Tract Area Estimation by Gradient Descent
☆39Jul 16, 2023Updated 3 years ago
NUS-HPC-AI-Lab / MoST
View on GitHub
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
☆33Jan 15, 2026Updated 6 months ago
google-deepmind / librispeech-long
View on GitHub
LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …
☆99Dec 28, 2024Updated last year
wsntxxn / UniFlow-Audio
View on GitHub
☆74Jul 17, 2026Updated last week
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆133Apr 8, 2026Updated 3 months ago
AIDASLab / Dynin-Omni
View on GitHub
Dynin-Omni: Open-Sourced Omnimodal Unified Large Diffusion Language Model
☆49Apr 13, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
0417keito / PromptTTS2
View on GitHub
[WIP] Unofficial Implementation of Microsoft's PromptTTS2
☆53Oct 31, 2023Updated 2 years ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
yongaifadian1 / MNV-17
View on GitHub
Qwen2.5-Omni fine-tuned on MNV-17 dataset for nonverbal vocalization recognition
☆31Nov 13, 2025Updated 8 months ago
MahtaFetrat / LLM-Powered-G2P
View on GitHub
Code and Resources for "LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study", introducing methods to leverage LLMs for G…
☆19May 21, 2025Updated last year
Jazzcharles / AuroLA
View on GitHub
☆28Feb 23, 2026Updated 5 months ago
krafton-ai / Raon-OpenTTS
View on GitHub
Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training …
☆75May 21, 2026Updated 2 months ago
glory20h / FitHuBERT
View on GitHub
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆19Nov 15, 2023Updated 2 years ago
yueyueL / XAIforAndroidMalware
View on GitHub
Explainable AI for Android Malware Detection: Towards Understanding Why the Models Perform So Well?
☆14Aug 24, 2022Updated 3 years ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
seungheondoh / msu-benchmark
View on GitHub
music semantic understanding evaluation benchmark
☆24Aug 12, 2023Updated 2 years ago
MontrealCorpusTools / MFA-reorganization-scripts
View on GitHub
Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner
☆43Jun 22, 2021Updated 5 years ago
alsgur9368 / FM-Singer
View on GitHub
☆19Jan 19, 2026Updated 6 months ago
AndroidLearningTeam / Android-Plan
View on GitHub
当今海量的移动应用跟人们的生活、工作、学习、休闲、娱乐等方面密切相关，发挥着重要作用。多数APP在安装、更新时，都会向用户申请相关手机权限。多数终端用户缺乏鉴别APP所请求的权限是否合理的能力，并且APP安装使用过程中过度索要权限现象较为普遍，这就给用户数据安全、隐私信息泄…
☆13Feb 11, 2020Updated 6 years ago
asus4 / kokoro-tts-unity
View on GitHub
☆22May 4, 2025Updated last year
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 3 years ago
cantabile-kwok / vec2wav2.0
View on GitHub
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆79Dec 3, 2024Updated last year