Kapjin/AGI_HER_TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Kapjin/AGI_HER_TTS)

Kapjin / AGI_HER_TTS

FastSpeech2, modified for training KSS Dataset. Modified from https://github.com/ming024/FastSpeech2

☆37

Alternatives and similar repositories for AGI_HER_TTS

Users that are interested in AGI_HER_TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lcw2014 / AGI_HER_LLM
View on GitHub
AGI_HER_LLM
☆35Dec 19, 2025Updated 7 months ago
seongq / AGI_HER_SE
View on GitHub
☆24Dec 19, 2025Updated 7 months ago
kpaul073 / AGI_HER_SV
View on GitHub
Flow matching based speaker verification
☆24Dec 20, 2025Updated 7 months ago
seongq / AGI_HER_MER
View on GitHub
☆29Dec 19, 2025Updated 7 months ago
seas2nada / AGI_HER_ASR
View on GitHub
End-to-end ASR repository for AGI
☆20Dec 19, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CARNIVAL-IITP / Speaker_recognition
View on GitHub
☆18Nov 18, 2022Updated 3 years ago
CARNIVAL-IITP / Noise_suppression
View on GitHub
☆35Feb 14, 2025Updated last year
seongq / cascadingtwoflowmatching
View on GitHub
(Interspeech 2025, official code) Speech enhancement based on cascaded two flows
☆16Jun 18, 2026Updated last month
CARNIVAL-IITP / Beamformer
View on GitHub
☆35Feb 14, 2025Updated last year
SMART-TTS / SMART-NAR_Fast_TTS
View on GitHub
☆50Jul 6, 2023Updated 3 years ago
CARNIVAL-IITP / Packet_loss_concealment
View on GitHub
☆34Feb 14, 2025Updated last year
CARNIVAL-IITP / Automatic_gain_control
View on GitHub
☆43Feb 14, 2025Updated last year
CARNIVAL-IITP / Sound_source_localization
View on GitHub
☆36Feb 14, 2025Updated last year
SMART-TTS / SMART-Single_Emotional_TTS
View on GitHub
☆96Jul 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
adrianSRoman / DeepWaveDOA
View on GitHub
ICASSP 2024: Robust DOA estimation from deep acoustic imaging
☆25Apr 14, 2024Updated 2 years ago
SMART-TTS / SMART-Long_Sentence_TTS
View on GitHub
☆51Jan 6, 2022Updated 4 years ago
CARNIVAL-IITP / Bandwidth-extension
View on GitHub
☆31Feb 14, 2025Updated last year
SMART-TTS / SMART-Vocoder
View on GitHub
☆59Jan 6, 2022Updated 4 years ago
DavidDiazGuerra / icoCNN
View on GitHub
Pytorch implementation of the icosahedral CNNs
☆21Apr 24, 2023Updated 3 years ago
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
SMART-TTS / SMART-G2P
View on GitHub
☆103Mar 24, 2023Updated 3 years ago
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆14Oct 14, 2023Updated 2 years ago
keonlee9420 / Deep-Learning-TTS-Template
View on GitHub
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
☆14Jun 15, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
seongq / flowmse
View on GitHub
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆108Jul 23, 2025Updated last year
dmlguq456 / TF_Restormer
View on GitHub
Official repository of TF-Restormer for speech restoration
☆15May 14, 2026Updated 2 months ago
CARNIVAL-IITP / Speech_source_separation
View on GitHub
☆23Feb 14, 2025Updated last year
sp-uhh / sgmse_crp
View on GitHub
☆32Jan 9, 2024Updated 2 years ago
CARNIVAL-IITP / Acoustic_echo_cancellation
View on GitHub
☆48Feb 14, 2025Updated last year
FuchenZhang / GS-MCC
View on GitHub
Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum
☆37Dec 15, 2024Updated last year
dmlguq456 / NeXt_TDNN_ASV
View on GitHub
Official repository of NeXt-TDNN for speaker verification
☆85Oct 10, 2024Updated last year
michaelneri / audio-distance-estimation
View on GitHub
Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …
☆40Jun 29, 2026Updated last month
infected4098 / Wave-U-Mamba
View on GitHub
An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.
☆26Oct 29, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lijfrank / GraphSmile
View on GitHub
☆30Dec 13, 2025Updated 7 months ago
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆36Feb 11, 2025Updated last year
hyyan2k / LiSenNet
View on GitHub
This is the official implementation of the LiSenNet
☆163Nov 15, 2024Updated last year
Yhonatangayer / shroom
View on GitHub
Spherical Harmonics ROOM, an open-source Python library for room acoustics simulation using Ambisonics, https://arxiv.org/abs/2603.27342,…
☆19Jul 12, 2026Updated 2 weeks ago
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
serkansulun / deep-music-enhancer
View on GitHub
Audio bandwidth enhancement with DNNs, addressing filter overfitting
☆41Oct 4, 2023Updated 2 years ago
Audio-WestlakeU / OnlineSSL_DPRTF_EG
View on GitHub
☆12Apr 1, 2020Updated 6 years ago