☆18Nov 18, 2022Updated 3 years ago
Alternatives and similar repositories for Speaker_recognition
Users that are interested in Speaker_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Feb 14, 2025Updated last year
- ☆48Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- ☆31Feb 14, 2025Updated last year
- ☆34Feb 14, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 발화자 지정 모듈☆20Feb 14, 2025Updated last year
- ☆50Jul 6, 2023Updated 2 years ago
- ☆59Jan 6, 2022Updated 4 years ago
- Multi-speaker & Multi-style TTS☆27Jul 3, 2024Updated last year
- Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"☆29Apr 26, 2024Updated 2 years ago
- ☆10Nov 16, 2024Updated last year
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆25Jun 9, 2025Updated 11 months ago
- ☆14Nov 26, 2024Updated last year
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆12Jul 13, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆24Feb 28, 2023Updated 3 years ago
- ☆13Apr 16, 2018Updated 8 years ago
- Flow matching based speaker verification☆24Dec 20, 2025Updated 4 months ago
- An iOS augmented reality app with a map of Saint Petersburg and it’s 3D landmark models☆19Sep 14, 2018Updated 7 years ago
- AGI_HER_LLM☆36Dec 19, 2025Updated 4 months ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆34Mar 11, 2026Updated last month
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆27Feb 11, 2023Updated 3 years ago
- Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …☆35Nov 18, 2025Updated 5 months ago
- pre-process script for timit data for dnn-aec works☆38Mar 3, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆24Jul 10, 2025Updated 10 months ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆19Dec 1, 2024Updated last year
- This repository is based on[RGBD_ORB_SLAM2_RT] (https://github.com/chaizheng2157/RGBD_ORB_SLAM2_RT). My main contribution is to use depth…☆17Jun 14, 2018Updated 7 years ago
- Text to emoji translation via DeepDreaming☆15Jun 24, 2021Updated 4 years ago
- My instruction of running ORB SLAM3 in Ubuntu 20.04 systems.☆17Oct 5, 2023Updated 2 years ago
- 서울대학교 전기정보공학부 학사 학위논문 LaTeX (비공식) 템플릿☆19Jun 21, 2021Updated 4 years ago
- [ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?☆46Nov 21, 2025Updated 5 months ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 4 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆35Mar 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- A simple Bidirectional Mamba☆32Jul 25, 2025Updated 9 months ago
- ☆31Dec 10, 2022Updated 3 years ago
- Speed-optimized streaming neural speech enhancement network☆112Apr 9, 2026Updated last month
- Python Wrapper for RnNoise v0.2☆77Jan 14, 2026Updated 3 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆61Aug 29, 2024Updated last year