shleee47/Sound-Source-Localization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shleee47/Sound-Source-Localization)

shleee47 / Sound-Source-Localization

Sound Source Localization for AI Grand Challenge 2021

☆21

Alternatives and similar repositories for Sound-Source-Localization

Users that are interested in Sound-Source-Localization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shleee47 / mpWAV-Sound-Source-Localization
View on GitHub
Sound Source Localization for AI Grand Challenge 2021
☆22Feb 8, 2022Updated 4 years ago
kooBH / drone-robust-gender-classification
View on GitHub
인명 구조용 드론을 위한 음성 화자 인지 기술
☆31Jan 31, 2023Updated 3 years ago
kooBH / PCM-A10-SSL
View on GitHub
Sound Source Localization for PCM-A10 Microphone
☆33Jan 31, 2023Updated 3 years ago
jmml-official / OCR_DB
View on GitHub
OCR DB including Korean
☆27Nov 11, 2021Updated 4 years ago
ncsoft / rotated-box-is-back
View on GitHub
Accurate Box Proposal Network for Scene Text Detection
☆30Feb 23, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ncsoft / drone-robust-gender-classification
View on GitHub
인명 구조용 드론을 위한 음성 화자 인지 기술
☆23Jan 10, 2023Updated 3 years ago
ncsoft / PCM-A10-SSL
View on GitHub
Sound Source Localization for PCM-A10 Microphone
☆24Jan 16, 2023Updated 3 years ago
IIP-Sogang / olkavs-avspeech
View on GitHub
The Introduction of the OLKAVS Dataset
☆39May 28, 2024Updated 2 years ago
uvify-public / rescue_drone_dataset
View on GitHub
☆27Jan 31, 2023Updated 3 years ago
yuhogun0908 / MISOnet
View on GitHub
Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)
☆52Jan 13, 2022Updated 4 years ago
ncsoft / rescue_drone_dataset
View on GitHub
인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋
☆24Jan 2, 2023Updated 3 years ago
ahstarwab / Violence_Detection
View on GitHub
Online and real-time violence recognition
☆16Jul 5, 2022Updated 4 years ago
dmlguq456 / NeXt_TDNN_ASV
View on GitHub
Official repository of NeXt-TDNN for speaker verification
☆84Oct 10, 2024Updated last year
jhCOR / EgoOrientBench
View on GitHub
The Official Code Repo for EgoOrientBench [CVPR25]
☆17Nov 24, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kooBH / STFT
View on GitHub
C++ STFT and others
☆42Feb 11, 2026Updated 5 months ago
JackSyu / Discriminative-Multi-modality-Speech-Recognition
View on GitHub
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
☆26Apr 27, 2022Updated 4 years ago
dmlguq456 / TF_Restormer
View on GitHub
Official repository of TF-Restormer for speech restoration
☆15May 14, 2026Updated 2 months ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
MiviaLab / DENet
View on GitHub
This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.
☆42Jul 23, 2023Updated 3 years ago
seongq / AGI_HER_MER
View on GitHub
☆29Dec 19, 2025Updated 7 months ago
prajwalkr / vtp
View on GitHub
Official Implementation of Visual Transformer Pooling for Lip reading
☆41Aug 8, 2022Updated 3 years ago
FrePainter / code
View on GitHub
☆28Mar 28, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
upskyy / Automatic-Speech-Recognition-Models
View on GitHub
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Jan 21, 2022Updated 4 years ago
gogyzzz / iip_sph_pp
View on GitHub
C library for speech pre-processing.
☆12Jun 7, 2019Updated 7 years ago
pljj315 / instant_id
View on GitHub
☆13Mar 22, 2024Updated 2 years ago
Sanyuan-Chen / CSS_with_EETransformer
View on GitHub
Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
☆12Sep 2, 2021Updated 4 years ago
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
yuhogun0908 / AEC
View on GitHub
Acoustic Echo Cancellation
☆14May 29, 2022Updated 4 years ago
thomeou / SALSA-Lite
View on GitHub
This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.
☆15Dec 3, 2021Updated 4 years ago
khanld / Dynamic-Mixing
View on GitHub
Dynamic Mixing For Speech Processing (mix-on-the-fly)
☆22Jul 19, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RookieJunChen / FullSubNet-plus
View on GitHub
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
☆293Jul 26, 2025Updated 11 months ago
nrl-ai / tracking-pipeline
View on GitHub
Tracking pipeline with detection models and tracking algorithms
☆34Dec 23, 2023Updated 2 years ago
kevaldoshi17 / NVIDIA_AICITY
View on GitHub
Repository for NVIDIA AICITY Challenge
☆15Jul 29, 2021Updated 4 years ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
wangkenpu / WSJ2WAV
View on GitHub
Convert WSJ sphere format to waveform and do data simulation.
☆16Feb 20, 2020Updated 6 years ago
mechanicalsea / sugar
View on GitHub
Efficient Speech Processing Tookit for Automatic Speaker Recognition
☆17Feb 8, 2023Updated 3 years ago
frednam93 / FilterAugSED
View on GitHub
☆68Sep 13, 2024Updated last year