인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋
☆24Jan 2, 2023Updated 3 years ago
Alternatives and similar repositories for rescue_drone_dataset
Users that are interested in rescue_drone_dataset are comparing it to the libraries listed below
Sorting:
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆23Jan 10, 2023Updated 3 years ago
- ☆27Jan 31, 2023Updated 3 years ago
- Sound Source Localization for PCM-A10 Microphone☆33Jan 31, 2023Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆31Jan 31, 2023Updated 3 years ago
- Accurate Box Proposal Network for Scene Text Detection☆30Feb 23, 2022Updated 4 years ago
- OCR DB including Korean☆27Nov 11, 2021Updated 4 years ago
- Diffusion-based korean text-to-image generation model☆12Aug 16, 2023Updated 2 years ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Feb 27, 2021Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- A Survey on video and language understanding.☆50Apr 21, 2023Updated 2 years ago
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Aug 22, 2023Updated 2 years ago
- xor activation☆26Jan 6, 2020Updated 6 years ago
- ☆33Feb 11, 2023Updated 3 years ago
- GANalyzer: Analysis and Manipulation of GANs Latent Space for Controllable Face Synthesis☆40Feb 15, 2024Updated 2 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- High speed low drag PHP using Docker☆12May 29, 2024Updated last year
- Inverse DALL-E for Optical Character Recognition☆38Oct 14, 2022Updated 3 years ago
- Code for CLVision workshop (CVPR 2024) paper - Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-train…☆11Nov 12, 2024Updated last year
- Multi-layer perceptron, Autoencoder, and Restricted Boltzmann Machine☆10Sep 15, 2018Updated 7 years ago
- Python scripts for WIDER FACE Evaluation☆10May 25, 2019Updated 6 years ago
- Bayesian Optimization Meets Self-Distillation, ICCV 2023☆10Aug 28, 2023Updated 2 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- Person of Interest. A flexible computer vision library on human analysis, such as person re-identification, human attribute, pose estima…☆10May 4, 2020Updated 5 years ago
- Deeplabv3(+) for BDD100k drivable area☆47Jul 15, 2020Updated 5 years ago
- Implementation for NeurIPS 2024 paper "SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models" (ht…☆14Dec 23, 2024Updated last year
- GAN-based naturalness-preserving image tone enhancement (PG 2019)☆11Dec 6, 2019Updated 6 years ago
- Iconv lib with android build scripts.☆14Sep 1, 2012Updated 13 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- ☆10Nov 10, 2022Updated 3 years ago
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago
- Scripts, Files, and Resources for Constructing a Large-scale Dataset of Blackbox Effects for Timbre Transfer☆16Feb 4, 2023Updated 3 years ago
- ☆12Apr 28, 2023Updated 2 years ago
- Code for one-stage adaptive set-based HOI detector AS-Net.☆52May 8, 2021Updated 4 years ago
- Music Demixing Challenge Submission Repo☆15Sep 8, 2023Updated 2 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024☆13Oct 4, 2024Updated last year
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆14Jun 15, 2021Updated 4 years ago
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction☆18Oct 20, 2025Updated 4 months ago
- Dimensional Emotion Detection from Categorical Emotion Annotation☆55Sep 23, 2021Updated 4 years ago