LCF2764/autoKWS2021_1st_solution

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LCF2764/autoKWS2021_1st_solution)

LCF2764 / autoKWS2021_1st_solution

Auto-KWS 2021 Challenge 1st place solution.

☆11

Alternatives and similar repositories for autoKWS2021_1st_solution

Users that are interested in autoKWS2021_1st_solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
idiap / CNN_QbE_STD
View on GitHub
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"
☆32Sep 3, 2018Updated 7 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
roman-vygon / triplet_loss_kws
View on GitHub
Learning Efficient Representations for Keyword Spotting with Triplet Loss
☆115Sep 14, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
c4dm / dcase-few-shot-bioacoustic
View on GitHub
☆61Jul 2, 2024Updated 2 years ago
gusrud1103 / LibriPhrase
View on GitHub
Recipe for LibriPhrase
☆38Sep 2, 2023Updated 2 years ago
jtkim-kaist / end-point-detection
View on GitHub
☆10Sep 19, 2018Updated 7 years ago
ishine / PnG-BERT
View on GitHub
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
☆24Jan 29, 2022Updated 4 years ago
Slyne / ctc_decoder
View on GitHub
A ctc decoder for both online and offline asr model
☆66Nov 18, 2023Updated 2 years ago
sonos / keyword-spotting-research-datasets
View on GitHub
☆141Sep 23, 2020Updated 5 years ago
hwanyyy / preprocessing-of-speech
View on GitHub
VAD + resampling | High resolution spectrogram
☆14Nov 29, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
jongwook / crepe
View on GitHub
☆12Jun 5, 2018Updated 8 years ago
xk-wang / MusicYOLO
View on GitHub
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆11Jan 29, 2022Updated 4 years ago
iiscleap / NISP-Dataset
View on GitHub
☆31Aug 9, 2022Updated 3 years ago
cadia-lvl / kaldi-speaker-diarization
View on GitHub
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Aug 12, 2024Updated last year
kaihuhuang / Language-Group
View on GitHub
☆11Dec 24, 2024Updated last year
DebabrataPal7 / DAFOSNET
View on GitHub
Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)
☆18Dec 18, 2023Updated 2 years ago
janparkio / 3d-presentation-godotengine
View on GitHub
An open source 3d slide presentation for the Godot Engine
☆11Aug 3, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Crystal-X-111 / python_project_to_so
View on GitHub
利用cython将整个python工程所有脚本打包成一个so并编译成whl包，用于python工程部署和代码加密
☆14Jul 6, 2021Updated 5 years ago
yushuai / FTANet-melodic
View on GitHub
This repository is the offical implementation for the paper 《Frequency-Temporal Attention Network for Singing Melody Extraction》.
☆40Sep 16, 2022Updated 3 years ago
auspicious3000 / SpeechSplit-Demo
View on GitHub
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Apr 29, 2020Updated 6 years ago
lightjiang / AudioProcess
View on GitHub
☆11Feb 3, 2018Updated 8 years ago
apoorvnandan / speech-recognition-primer
View on GitHub
This repository contains code for a tutorial on end to end automatic speech recognition.
☆18Sep 10, 2019Updated 6 years ago
Arunprakaash / openvoice.streaming.server
View on GitHub
FastAPI WebSocket server for the OpenVoice text-to-speech model.
☆12Jun 6, 2024Updated 2 years ago
MAS-KE / ICDM_2020_KGC
View on GitHub
Consumer Event Cause Extraction Baseline Model
☆16Aug 3, 2020Updated 5 years ago
Confusezius / Characterizing_Generalization_in_DeepMetricLearning
View on GitHub
Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.
☆25Oct 2, 2021Updated 4 years ago
Itachi6912110 / Hierarchical-Note-Segmentation
View on GitHub
Realization for note segmentation by using hierarchical objective function
☆14Jun 26, 2019Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
CODEJIN / PWGAN_for_HiFiSinger
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
vectominist / End-to-end-ASR-Pytorch-DLHLP
View on GitHub
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
☆17Nov 22, 2020Updated 5 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
dhfbk / Histo
View on GitHub
☆15Jan 9, 2019Updated 7 years ago
fakufaku / auxiva-ipa
View on GitHub
Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.
☆36Mar 22, 2021Updated 5 years ago
AI-confused / arxiv_auto_crawler
View on GitHub
auto scrawl for arrive data
☆16Jan 24, 2022Updated 4 years ago
alaaNfissi / SigWavNet-Learning-Multiresolution-Signal-Wavelet-Network-for-Speech-Emotion-Recognition
View on GitHub
This paper has been accepted for publication in IEEE Transactions on Affective Computing.
☆20Feb 27, 2025Updated last year