☆18Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for Speaker_recognition
Users that are interested in Speaker_recognition are comparing it to the libraries listed below
Sorting:
- ☆35Feb 14, 2025Updated last year
- ☆42Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- ☆48Feb 14, 2025Updated last year
- ☆31Feb 14, 2025Updated last year
- ☆50Jul 6, 2023Updated 2 years ago
- ☆51Jan 6, 2022Updated 4 years ago
- ☆96Jul 6, 2023Updated 2 years ago
- ☆101Mar 24, 2023Updated 2 years ago
- Direction of arrival (DOA) estimation is a fundamental problem in array signal processing with applications spanning radar, sonar, wirele…☆27Sep 1, 2025Updated 6 months ago
- FastSpeech2, modified for training KSS Dataset. Modified from https://github.com/ming024/FastSpeech2☆38Dec 19, 2025Updated 2 months ago
- ☆10Nov 16, 2024Updated last year
- ☆14Nov 26, 2024Updated last year
- ☆25Dec 19, 2025Updated 2 months ago
- ICASSP 2024: Robust DOA estimation from deep acoustic imaging☆22Apr 14, 2024Updated last year
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆24Jun 9, 2025Updated 9 months ago
- AGI_HER_LLM☆36Dec 19, 2025Updated 2 months ago
- ☆22Jul 10, 2025Updated 8 months ago
- Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"☆28Apr 26, 2024Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- AEC Challenge☆14Nov 12, 2021Updated 4 years ago
- PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection☆35Sep 17, 2025Updated 5 months ago
- Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …☆30Nov 18, 2025Updated 3 months ago
- ☆24Feb 28, 2023Updated 3 years ago
- pre-process script for timit data for dnn-aec works☆36Mar 3, 2022Updated 4 years ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆61Aug 29, 2024Updated last year
- AudioBench: A Universal Benchmark for Audio Large Language Models☆295Jun 17, 2025Updated 8 months ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆94Dec 3, 2024Updated last year
- This is the official implementation of the LiSenNet☆150Nov 15, 2024Updated last year
- ☆136Oct 25, 2021Updated 4 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆153Apr 29, 2025Updated 10 months ago
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆197Feb 25, 2026Updated last week
- ☆167Nov 28, 2024Updated last year
- Expressive Anechoic Recordings of Speech (EARS)☆210Jun 25, 2024Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆212Sep 19, 2024Updated last year
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆194Jul 12, 2024Updated last year
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆251Dec 12, 2025Updated 2 months ago
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆336Jan 29, 2026Updated last month