☆18Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for Speaker_recognition
Users that are interested in Speaker_recognition are comparing it to the libraries listed below
Sorting:
- ☆23Feb 14, 2025Updated last year
- ☆35Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- ☆42Feb 14, 2025Updated last year
- ☆48Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- ☆31Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- 발화자 지정 모듈☆20Feb 14, 2025Updated last year
- ☆51Jan 6, 2022Updated 4 years ago
- ☆59Jan 6, 2022Updated 4 years ago
- ☆101Mar 24, 2023Updated 2 years ago
- Multi-speaker & Multi-style TTS☆28Jul 3, 2024Updated last year
- Direction of arrival (DOA) estimation is a fundamental problem in array signal processing with applications spanning radar, sonar, wirele…☆27Sep 1, 2025Updated 6 months ago
- ☆10Nov 16, 2024Updated last year
- ☆14Nov 26, 2024Updated last year
- Flow matching based speaker verification☆24Dec 20, 2025Updated 2 months ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆12Jul 13, 2023Updated 2 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆23Jun 9, 2025Updated 9 months ago
- ☆22Jul 10, 2025Updated 8 months ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- AEC Challenge☆14Nov 12, 2021Updated 4 years ago
- TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification☆27Mar 12, 2025Updated 11 months ago
- PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection☆35Sep 17, 2025Updated 5 months ago
- [ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?☆43Nov 21, 2025Updated 3 months ago
- Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …☆30Nov 18, 2025Updated 3 months ago
- ☆24Feb 28, 2023Updated 3 years ago
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆27Feb 11, 2023Updated 3 years ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆33Feb 6, 2026Updated last month
- Speed-optimized streaming neural speech enhancement network☆90Feb 27, 2026Updated last week
- ☆30Dec 10, 2022Updated 3 years ago
- Python Wrapper for RnNoise v0.2☆75Jan 14, 2026Updated last month
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆60Aug 29, 2024Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- AudioBench: A Universal Benchmark for Audio Large Language Models☆295Jun 17, 2025Updated 8 months ago
- small audio language model for reasoning☆86Dec 4, 2025Updated 3 months ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆85Jun 22, 2025Updated 8 months ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- Code for ICML 2024 paper (Oral) — Test-Time Model Adaptation with Only Forward Passes☆95Aug 22, 2024Updated last year