Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 4 years ago
Alternatives and similar repositories for autoKWS2021_1st_solution
Users that are interested in autoKWS2021_1st_solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Mar 25, 2021Updated 5 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Sep 3, 2018Updated 7 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆113Sep 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆60Jul 2, 2024Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Recipe for LibriPhrase☆35Sep 2, 2023Updated 2 years ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 3 years ago
- ☆10Sep 19, 2018Updated 7 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- ☆135Sep 23, 2020Updated 5 years ago
- ☆11Dec 24, 2024Updated last year
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- ☆12Jun 5, 2018Updated 7 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆11Jan 29, 2022Updated 4 years ago
- ☆31Aug 9, 2022Updated 3 years ago
- Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)☆16Dec 18, 2023Updated 2 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- An open source 3d slide presentation for the Godot Engine☆11Aug 3, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 利用cython将整个python工程所有脚本打包成一个so并编译成whl包,用于python工程部署和代码加密☆14Jul 6, 2021Updated 4 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 5 years ago
- ☆11Feb 3, 2018Updated 8 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆17Sep 10, 2019Updated 6 years ago
- FastAPI WebSocket server for the OpenVoice text-to-speech model.☆12Jun 6, 2024Updated last year
- Realization for note segmentation by using hierarchical objective function☆14Jun 26, 2019Updated 6 years ago
- auto scrawl for arrive data☆16Jan 24, 2022Updated 4 years ago
- Utility to test how network losses affects speech quality in VoIP-based applications☆24Jul 18, 2013Updated 12 years ago
- Consumer Event Cause Extraction Baseline Model☆16Aug 3, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆25Oct 2, 2021Updated 4 years ago
- This repository is the offical implementation for the paper 《Frequency-Temporal Attention Network for Singing Melody Extraction》.☆40Sep 16, 2022Updated 3 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆35Mar 22, 2021Updated 5 years ago
- PyQt5实现的软键盘☆12Aug 18, 2020Updated 5 years ago
- ☆15Jan 9, 2019Updated 7 years ago
- A comfortable way to describe parameter interface and generate its underlying data structure☆14Oct 8, 2023Updated 2 years ago