인명 구조용 드론을 위한 음성 화자 인지 기술
☆31Jan 31, 2023Updated 3 years ago
Alternatives and similar repositories for drone-robust-gender-classification
Users that are interested in drone-robust-gender-classification are comparing it to the libraries listed below
Sorting:
- Sound Source Localization for PCM-A10 Microphone☆33Jan 31, 2023Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆24Jan 2, 2023Updated 3 years ago
- Sound Source Localization for PCM-A10 Microphone☆24Jan 16, 2023Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆23Jan 10, 2023Updated 3 years ago
- ☆27Jan 31, 2023Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Feb 7, 2022Updated 4 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Feb 8, 2022Updated 4 years ago
- The Introduction of the OLKAVS Dataset☆37May 28, 2024Updated last year
- Official repository of NeXt-TDNN for speaker verification☆80Oct 10, 2024Updated last year
- The Official Code Repo for EgoOrientBench [CVPR25]☆14Nov 24, 2025Updated 3 months ago
- Tracking pipeline with detection models and tracking algorithms☆34Dec 23, 2023Updated 2 years ago
- C++ STFT and others☆40Feb 11, 2026Updated 3 weeks ago
- Pytorch implementation of CycleGAN.☆42Sep 6, 2017Updated 8 years ago
- A Survey on video and language understanding.☆50Apr 21, 2023Updated 2 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- PyTorch implementation of DCGAN☆52Aug 21, 2017Updated 8 years ago
- STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지☆68Jun 18, 2025Updated 8 months ago
- ☆33Feb 11, 2023Updated 3 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- Code for CLVision workshop (CVPR 2024) paper - Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-train…☆11Nov 12, 2024Updated last year
- Official Repository of Six Dragons Fly Again (ISMIR 2024)☆13Nov 13, 2025Updated 3 months ago
- [Journal of Artificial Intelligence Research] Source code for our paper "Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synth…☆12Jan 8, 2024Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- Bayesian Optimization Meets Self-Distillation, ICCV 2023☆10Aug 28, 2023Updated 2 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Aug 8, 2022Updated 3 years ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆36Updated this week
- ☆16Jun 1, 2023Updated 2 years ago
- GAN-based naturalness-preserving image tone enhancement (PG 2019)☆11Dec 6, 2019Updated 6 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- This is a simple paper trading cryptocurrency trading bot that's using Coingecko data to buy and sell coins based on price movement. It c…☆12Dec 12, 2024Updated last year
- Implementation for NeurIPS 2024 paper "SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models" (ht…☆14Dec 23, 2024Updated last year
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Updated this week
- nocaps: novel object captioning at scale☆10May 23, 2019Updated 6 years ago