3rd Grand Challenge track 3 DB developed by GIST
☆35Apr 9, 2021Updated 5 years ago
Alternatives and similar repositories for GC_track3_DB_GIST
Users that are interested in GC_track3_DB_GIST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Problem Generator for Math Word Prediction☆16Nov 28, 2021Updated 4 years ago
- Grand Challenge 4 track 2 sourcecode developed by GIST☆13Mar 24, 2021Updated 5 years ago
- ☆20Mar 25, 2026Updated last month
- Deep learning based autism spectral disorder detection from children voice☆42Nov 5, 2025Updated 6 months ago
- 2020 AI Grand Challenge (3rd track) - public sample☆16Jan 20, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Baseline of DCASE 2020 task 4☆44Oct 24, 2022Updated 3 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Jul 6, 2022Updated 3 years ago
- Speech enhancement (Interspeech 2016, Ideal)☆19Jun 25, 2022Updated 3 years ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Music segmentation by ordinal linear discriminant analysis☆18Nov 10, 2017Updated 8 years ago
- Neural network train horn detection☆12Sep 21, 2020Updated 5 years ago
- ☆20Dec 20, 2017Updated 8 years ago
- ☆11Sep 4, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Aug 29, 2024Updated last year
- ☆16Feb 19, 2026Updated 2 months ago
- ☆13Oct 25, 2024Updated last year
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- Instructions for reproducing the research described in the paper "Tempo Estimation for Music Loops and a Simple Confidence Measure"☆14Nov 18, 2016Updated 9 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- Accurate Box Proposal Network for Scene Text Detection☆30Feb 23, 2022Updated 4 years ago
- OCR DB including Korean☆27Nov 11, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- zero_shot_gradtts☆14Oct 23, 2023Updated 2 years ago
- A high performance, multithreaded WebSocket server written in Python.☆24Sep 30, 2020Updated 5 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 11 months ago
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆20May 24, 2023Updated 2 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- ☆13Sep 25, 2018Updated 7 years ago
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- The products,docs,resources,software,tools,scripts,models,demos☆16Jun 1, 2020Updated 5 years ago
- A python script to convert a mono/stereo audio track to an ambisonics track (not properly) by phase and pan.☆31Jun 8, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Sep 19, 2023Updated 2 years ago
- Stream your webcam to multiple clients (VLC for eg:) at the same time☆12Dec 5, 2019Updated 6 years ago
- Track an object across a CCTV Network with non-overlapping camera views.☆81Apr 10, 2021Updated 5 years ago
- Sound Source Localization for PCM-A10 Microphone☆24Jan 16, 2023Updated 3 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- Tacotron, Korean, Wavenet-Vocoder, Korean TTS☆174Dec 26, 2022Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year