icynic/desktop-live-caption

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/icynic/desktop-live-caption)

icynic / desktop-live-caption

Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference, PyAudio for reading stream, Tkinter for GUI.

☆14

Alternatives and similar repositories for desktop-live-caption

Users that are interested in desktop-live-caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aintp3d0 / myreadingmanga.info
View on GitHub
Download images and convert it to pdf (NSFW: A+)
☆14Mar 29, 2025Updated last year
ktsstudio / webinar-tgbot
View on GitHub
☆12Feb 15, 2022Updated 4 years ago
Tyler-KD / multi-model-AI-assistant-medical-bot
View on GitHub
RAG-enabled multi-agentic system for medical diagnosis and assistance
☆21Mar 8, 2026Updated 4 months ago
mydev-history / chatbot-editor
View on GitHub
☆18Oct 9, 2025Updated 9 months ago
kittysoftpaw0510 / Trading-with-AI-Agent
View on GitHub
☆17Apr 1, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SELMA-project / ml4audio
View on GitHub
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Sep 4, 2023Updated 2 years ago
cohen1280 / PrivateGPT
View on GitHub
☆17Aug 25, 2023Updated 2 years ago
lovelysystems / robotframework-imaplibrary
View on GitHub
Mail Library for Robot Framework
☆29Dec 21, 2015Updated 10 years ago
motahhir / MPPT-in-Proteus
View on GitHub
Data’ of “Development of a low-cost PV system using an improved INC algorithm and a PV panel Proteus model” research paper
☆16May 9, 2023Updated 3 years ago
JocysCom / SVNNotifier
View on GitHub
Notifies you about other people's commits to Subversion repositories
☆10Aug 31, 2019Updated 6 years ago
EnricoCecchini / Narrator-AI
View on GitHub
Svelte app to generate audiobooks using XTTS
☆12Feb 13, 2024Updated 2 years ago
Arunprakaash / openvoice.streaming.server
View on GitHub
FastAPI WebSocket server for the OpenVoice text-to-speech model.
☆12Jun 6, 2024Updated 2 years ago
JacobLinCool / whisper-cli
View on GitHub
A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.
☆22Updated this week
wtdcode / closure2fp
View on GitHub
An example to illustrate how libffi cast a closure to a pointer to function.
☆16Jul 30, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
huangheyellowriver / ICAdemo
View on GitHub
☆10May 22, 2022Updated 4 years ago
CesiumBen / professional-game-development-cpp-unreal
View on GitHub
Tom Looman course files about professional game development in C++ and Unreal Engine
☆14Aug 12, 2022Updated 3 years ago
Jordain / Comfy_Image_Workshop
View on GitHub
A scalable solution that simplifies the integration of ComfyUI for developers
☆11Jul 15, 2024Updated 2 years ago
1-max-1 / WASMSerialTerminal
View on GitHub
A tool for quick analysis of data from a serial device.
☆15Nov 25, 2024Updated last year
CitizenOneX / frame_transcribe_googlespeech
View on GitHub
App for Brilliant Labs Frame to transcribe audio in real-time through the Frame microphone using the Google Cloud Speech API
☆13Dec 21, 2024Updated last year
riba2534 / bencode
View on GitHub
`.torrent`文件解析器
☆11Mar 27, 2021Updated 5 years ago
satiseason / Chatbot-with-text-voice-chatting
View on GitHub
Telegram bot is developed by AI techniques(Speech-to-Text, Text-to-Speech, Voice-cloning, AI-avatar-geneartor) and telegram bot developin…
☆16May 14, 2025Updated last year
DYYYYYYYF / DimensionEngine
View on GitHub
3D-Rendering of Vulkan
☆14Jul 10, 2026Updated last week
nnao45 / dntk
View on GitHub
🧮 Command line's multi-platform interactive calculator, with bc-compatible syntax and high-precision arithmetic.
☆13Oct 19, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JakeWharton / prerelease-testing
View on GitHub
Automatically test projects against the latest versions of Kotlin, Gradle, and each other
☆28Updated this week
Intersection98 / ComfyUI_MX_post_processing-nodes
View on GitHub
☆13May 23, 2024Updated 2 years ago
AlessioMichelassi / openPyVision_013
View on GitHub
Welcome to my project. OpenPyVision is a real time videoMixer based on opencv and pyqt6.
☆14Aug 22, 2024Updated last year
mkdev-me / voice-to-gpt-with-api
View on GitHub
voice recorded with whisper library to ask GPT API what you need and will speak to you with whisper API
☆17May 6, 2023Updated 3 years ago
tigert1998 / tair-last-jedi
View on GitHub
阿里云第二届数据库大赛新手门槛队（季军）解决方案
☆10Apr 19, 2021Updated 5 years ago
qqgeogor / Kuaishou-benchmark
View on GitHub
☆10May 28, 2018Updated 8 years ago
VitalPBX / vitalpbx_agent_ai
View on GitHub
VitalPBX - AI Agent with OpenAI ChatGPT, Whisper and Microsoft Azure AI Speech (TTS)
☆20Jan 24, 2024Updated 2 years ago
UBC-NLP / octopus
View on GitHub
Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)
☆10Apr 29, 2024Updated 2 years ago
AstroWYH / Cpp-Basic-Notes
View on GitHub
C++各类基础知识整理--Astro WANG
☆16Aug 28, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ELATTAR-Ayoub / YukiChat---Talking-ChatGPT
View on GitHub
YukiChat is a web application that allows users to have a natural, oral conversation with OpenAI's GPT language model using text-to-speec…
☆15Oct 9, 2023Updated 2 years ago
DaVinci-Code-ai / facemind
View on GitHub
☆18Jul 15, 2025Updated last year
scpp / gfft
View on GitHub
Generative Fast Fourier Transforms in C++ using template metaprogramming
☆10Jun 16, 2016Updated 10 years ago
rockerBOO / sd-tokenizer
View on GitHub
View the tokenisation of your words using the tokeniser for a Stable Diffusion model.
☆15Jun 9, 2026Updated last month
taytaybear / cascadia
View on GitHub
☆20Jan 14, 2025Updated last year
shadyabh / UDPM
View on GitHub
☆24Jan 24, 2026Updated 5 months ago
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago