seongq/AGI_HER_SE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/seongq/AGI_HER_SE)

seongq / AGI_HER_SE

☆24

Alternatives and similar repositories for AGI_HER_SE

Users that are interested in AGI_HER_SE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kpaul073 / AGI_HER_SV
View on GitHub
Flow matching based speaker verification
☆24Dec 20, 2025Updated 6 months ago
lcw2014 / AGI_HER_LLM
View on GitHub
AGI_HER_LLM
☆35Dec 19, 2025Updated 6 months ago
seongq / AGI_HER_MER
View on GitHub
☆29Dec 19, 2025Updated 6 months ago
Kapjin / AGI_HER_TTS
View on GitHub
FastSpeech2, modified for training KSS Dataset. Modified from https://github.com/ming024/FastSpeech2
☆37Dec 19, 2025Updated 6 months ago
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆14Oct 14, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
FuchenZhang / GS-MCC
View on GitHub
Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum
☆35Dec 15, 2024Updated last year
AmgadSalama / DOA
View on GitHub
Direction of arrival (DOA) estimation is a fundamental problem in array signal processing with applications spanning radar, sonar, wirele…
☆39Jun 5, 2026Updated last month
seongq / cascadingtwoflowmatching
View on GitHub
(Interspeech 2025, official code) Speech enhancement based on cascaded two flows
☆16Jun 18, 2026Updated 3 weeks ago
ASLP-lab / Easy-Turn
View on GitHub
Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems
☆118Jan 25, 2026Updated 5 months ago
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated last month
CARNIVAL-IITP / Noise_suppression
View on GitHub
☆35Feb 14, 2025Updated last year
adrianSRoman / DeepWaveDOA
View on GitHub
ICASSP 2024: Robust DOA estimation from deep acoustic imaging
☆24Apr 14, 2024Updated 2 years ago
naver-ai / RapFlow-TTS
View on GitHub
☆55Jul 16, 2025Updated 11 months ago
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
View on GitHub
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆57Mar 12, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
dhuertas / http
View on GitHub
A custom, multi-threaded HTTP web server
☆35Feb 17, 2013Updated 13 years ago
CARNIVAL-IITP / Speaker_recognition
View on GitHub
☆18Nov 18, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
cychomatica / AudioPure
View on GitHub
Defending against Adversarial Audio via Diffusion Model (ICLR 2023)
☆35Mar 2, 2023Updated 3 years ago
AmazingDay1 / TAME
View on GitHub
TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification
☆32Mar 12, 2025Updated last year
AmandineBtto / NeRAF
View on GitHub
[ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.
☆36Mar 11, 2026Updated 3 months ago
BingYang-20 / DP-RTF-Learning
View on GitHub
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
☆28Feb 11, 2023Updated 3 years ago
ydqmkkx / ShallowFlowMatching-TTS
View on GitHub
Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis
☆55Sep 20, 2025Updated 9 months ago
michaelneri / audio-distance-estimation
View on GitHub
Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …
☆40Jun 29, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CARNIVAL-IITP / Speech_source_separation
View on GitHub
☆23Feb 14, 2025Updated last year
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆46Sep 17, 2025Updated 9 months ago
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
piotrkawa / specrnet
View on GitHub
Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper
☆39Mar 21, 2023Updated 3 years ago
eloffel / improved_embeddings
View on GitHub
Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)
☆11May 2, 2019Updated 7 years ago
miguelballesteros / LSTM-punctuation
View on GitHub
☆11Feb 17, 2017Updated 9 years ago
teddysum / korean_evaluation
View on GitHub
☆10Jun 5, 2025Updated last year
Tikai7 / DiTTO-TTS
View on GitHub
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆39Feb 11, 2025Updated last year
koushikkonwar / Few-Shot-
View on GitHub
Few shot learning in NLP
☆11Oct 1, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kakaoenterprise / OutFlip
View on GitHub
Implementation of the ACL Findings paper "OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack"
☆10May 24, 2021Updated 5 years ago
kh-kim / deeplearning_with_pytorch
View on GitHub
☆12Mar 8, 2020Updated 6 years ago
Yoctol / text-normalizer
View on GitHub
Normalize text string
☆12Nov 6, 2018Updated 7 years ago
kartikgill / taco-box
View on GitHub
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Dec 4, 2021Updated 4 years ago
ZarahShibli / Arabic_Punctuation_Prediction
View on GitHub
Sequence to sequence model for Arabic punctuation prediction.
☆12Feb 13, 2020Updated 6 years ago
machinelearning-pangyo / Hands-On-MachineLearning
View on GitHub
Let's make good things!
☆13Aug 22, 2018Updated 7 years ago
hwRG / End-to-End-TTS-Fine-Tune
View on GitHub
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
☆29Jul 30, 2023Updated 2 years ago