Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference, PyAudio for reading stream, Tkinter for GUI.
☆14May 7, 2024Updated last year
Alternatives and similar repositories for desktop-live-caption
Users that are interested in desktop-live-caption are comparing it to the libraries listed below
Sorting:
- Download images and convert it to pdf (NSFW: A+)☆14Mar 29, 2025Updated 11 months ago
- ☆12Feb 15, 2022Updated 4 years ago
- Automatically test projects against the latest versions of Kotlin, Gradle, and each other☆26Updated this week
- ☆18Jun 13, 2025Updated 9 months ago
- ☆18Oct 9, 2025Updated 5 months ago
- Mail Library for Robot Framework☆29Dec 21, 2015Updated 10 years ago
- Notifies you about other people's commits to Subversion repositories☆10Aug 31, 2019Updated 6 years ago
- Svelte app to generate audiobooks using XTTS☆12Feb 13, 2024Updated 2 years ago
- FastAPI WebSocket server for the OpenVoice text-to-speech model.☆12Jun 6, 2024Updated last year
- ☆17Aug 25, 2023Updated 2 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.☆22Mar 14, 2026Updated last week
- An example to illustrate how libffi cast a closure to a pointer to function.☆16Jul 30, 2021Updated 4 years ago
- ☆10May 22, 2022Updated 3 years ago
- Tom Looman course files about professional game development in C++ and Unreal Engine☆14Aug 12, 2022Updated 3 years ago
- ☆13May 23, 2024Updated last year
- Data’ of “Development of a low-cost PV system using an improved INC algorithm and a PV panel Proteus model” research paper☆15May 9, 2023Updated 2 years ago
- ☆19Aug 25, 2024Updated last year
- Welcome to my project. OpenPyVision is a real time videoMixer based on opencv and pyqt6.☆14Aug 22, 2024Updated last year
- A tool for quick analysis of data from a serial device.☆15Nov 25, 2024Updated last year
- `.torrent`文件解析器☆11Mar 27, 2021Updated 4 years ago
- state-of-the-art models for diacritics restoration for Arabic language☆17Feb 23, 2025Updated last year
- 🧮 Command line's multi-platform interactive calculator, with bc-compatible syntax and high-precision arithmetic.☆13Oct 19, 2025Updated 5 months ago
- 阿里云第二届数据库大赛新手门槛队(季军)解决方案☆10Apr 19, 2021Updated 4 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- voice recorded with whisper library to ask GPT API what you need and will speak to you with whisper API☆17May 6, 2023Updated 2 years ago
- ☆10May 28, 2018Updated 7 years ago
- Fast Punctuation Restoration using Transformer Models for Vietnamese☆11Jun 10, 2022Updated 3 years ago
- Generative Fast Fourier Transforms in C++ using template metaprogramming☆10Jun 16, 2016Updated 9 years ago
- C++各类基础知识整理--Astro WANG☆15Aug 28, 2025Updated 6 months ago
- VitalPBX - AI Agent with OpenAI ChatGPT, Whisper and Microsoft Azure AI Speech (TTS)☆20Jan 24, 2024Updated 2 years ago
- YukiChat is a web application that allows users to have a natural, oral conversation with OpenAI's GPT language model using text-to-speec…☆15Oct 9, 2023Updated 2 years ago
- ☆23Jan 24, 2026Updated last month
- ☆12May 21, 2020Updated 5 years ago
- Python script for creating 3x3 matrix DCTLs using source and target ColorChecker images☆19Jul 26, 2024Updated last year
- App for Brilliant Labs Frame to transcribe audio in real-time through the Frame microphone using the Google Cloud Speech API☆13Dec 21, 2024Updated last year
- Use this library to connect your iOS, WatchOS, or MacOS app to the Vuzix Z100™ smart glasses.☆14Mar 18, 2025Updated last year
- An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features incl…☆21May 24, 2024Updated last year