Nyan-SouthKorea / RealTime_STT_with_WhisperView external linksLinks
Real Time STT model with GPU by Whisper and VAD(Voice Activity Detector) model
☆15Jul 15, 2024Updated last year
Alternatives and similar repositories for RealTime_STT_with_Whisper
Users that are interested in RealTime_STT_with_Whisper are comparing it to the libraries listed below
Sorting:
- Ollama 기반의 int4 gguf 형식 sLLM을 multi-turn 형태로 대화할 수 있는 통합 모듈☆14Jul 25, 2024Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- Best Cities To Live☆13Jan 26, 2023Updated 3 years ago
- Instance-Dependent Noisy Label Learning via Graphical Modelling (WACV 2023 Round 1)☆13Jul 30, 2023Updated 2 years ago
- ☆11Jul 3, 2023Updated 2 years ago
- Ubuntu Setup for Research in Computer Vision and Robotics☆11Aug 11, 2021Updated 4 years ago
- DICOM 공부 내용 정리☆10Mar 20, 2019Updated 6 years ago
- INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries [PPoPP'24]☆10Jan 15, 2024Updated 2 years ago
- Deep Semi-Supervised Learning with Holistic methods for audio classification.☆11Dec 14, 2024Updated last year
- Go HTTP Middleware with dynamic CSP nonce and much more☆16Aug 28, 2018Updated 7 years ago
- mitum is general purpose blockchain factory.☆12Jul 19, 2023Updated 2 years ago
- ☆12Oct 19, 2025Updated 3 months ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- Hanyang University 2019 OSS Class (1) Github Page☆10Dec 23, 2019Updated 6 years ago
- ☆13Aug 10, 2024Updated last year
- A demo application that combines a human face and sunglasses images in real-time. This demo allows for yaw angle rotation.☆10Jun 21, 2022Updated 3 years ago
- C# .Net project to create RTSP live stream with Jsmpeg☆11Nov 15, 2017Updated 8 years ago
- ☆21Sep 27, 2024Updated last year
- TensorFlow implementation of Disentangled Generative Model (DGM) with MNIST dataset.☆12Nov 24, 2020Updated 5 years ago
- A proxy server enabling access to Groq API within Cursor IDE☆11Feb 27, 2024Updated last year
- Top 3 solution for CVPR24 SEGMENT ANYTHING IN MEDICAL IMAGES ON LAPTOP Challenge☆10Apr 8, 2025Updated 10 months ago
- discord.js로 만든 디스코드 음악봇입니다.☆15Jun 6, 2024Updated last year
- 한국 5개 지역의 사투리로 말하는 인공지능. Tacotron2 기반.☆13Sep 14, 2022Updated 3 years ago
- Detection of car numbers and their recognition☆11Oct 11, 2022Updated 3 years ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- Pixel VQ-VAEs for Improved Pixel Art Representation☆17Feb 11, 2023Updated 3 years ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- Fast model deployment on Google Cloud Run☆16Feb 25, 2024Updated last year
- A scope non-breaking foreach loop for arrays, lists, maps, structs, grids, strings and number ranges☆14Dec 31, 2025Updated last month
- block tor users to access your website.☆16Jun 2, 2016Updated 9 years ago
- Official repository for "C-DARL: Contrastive diffusion adversarial representation learning for label-free blood vessel segmentation"☆19Nov 15, 2023Updated 2 years ago
- ☆14Dec 14, 2021Updated 4 years ago
- ☆13Jun 16, 2024Updated last year
- ☆17Jul 11, 2023Updated 2 years ago
- Praat-based tools for EGG analysis☆18Sep 21, 2023Updated 2 years ago
- Official implementation of DiffMix (MICCAI 2023)☆16Nov 27, 2024Updated last year
- Duke: Voice Controlled Robot Dog. A for fun project that I completed while recovering from knee surgery.☆21Oct 17, 2019Updated 6 years ago
- Multi-speaker Tacotron in TensorFlow.☆17Mar 9, 2022Updated 3 years ago
- 칸코레 뷰어 한글버전☆10Sep 6, 2019Updated 6 years ago