Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
☆47Nov 6, 2023Updated 2 years ago
Alternatives and similar repositories for vision-core-ai
Users that are interested in vision-core-ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- llama.cpp with BakLLaVA model describes what does it see☆379Nov 8, 2023Updated 2 years ago
- An experiment of trying out whisper.cpp for real-time speech-to-text☆20Dec 25, 2022Updated 3 years ago
- Alpaca Core local daemon☆24May 27, 2025Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆495Nov 28, 2023Updated 2 years ago
- ☆24Mar 13, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The application performs real-time inference on audio from an ALSA capture device☆39Jun 19, 2025Updated 11 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆24Oct 30, 2024Updated last year
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 3 years ago
- GRDN.AI app for garden optimization☆69Nov 21, 2025Updated 6 months ago
- YT2Brief: Transcribe and summarize YouTube videos using Langchain with power of LLMs.☆11Dec 21, 2023Updated 2 years ago
- ☆13Mar 10, 2025Updated last year
- Downsampling array of intervals☆26Dec 11, 2019Updated 6 years ago
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Repository of UltraVoice☆62Oct 28, 2025Updated 7 months ago
- axseem's Linux Workstation Configuration | Mirror of https://codeberg.org/axseem/dots☆20May 16, 2026Updated 3 weeks ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 7 months ago
- Decrypt multicast Verimatrix streams☆13Apr 21, 2022Updated 4 years ago
- StrongSort-Pip: Packaged version of StrongSort☆10Sep 3, 2022Updated 3 years ago
- ☆11May 27, 2023Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Agent Skills for Meta Quest/Horizon OS VR Development☆80May 19, 2026Updated 3 weeks ago
- ☆18Apr 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- Using PDFPlumber for PDF data extraction☆13May 31, 2017Updated 9 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆313Apr 11, 2024Updated 2 years ago
- Simple example of autonomous research ran in parallel from my Aetherius Ai Assistant project. Uses Openai's GPT-3.5, GPT-4, and Microsof…☆15May 11, 2023Updated 3 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆862Nov 16, 2024Updated last year
- ☆13Aug 24, 2023Updated 2 years ago
- Using OpenAI's Whisper via whisper.cpp with SFML☆14Dec 2, 2025Updated 6 months ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An android VoIP application using native SIP API & ConnectionService (CallKit in iOS) API☆10Mar 13, 2020Updated 6 years ago
- LLM-based code completion engine☆194Jan 23, 2025Updated last year
- HuggingFace hosted inference models plugin for Auto-GPT☆20May 4, 2023Updated 3 years ago
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Web App to transcribe memos using Whisper AI.☆18Oct 23, 2022Updated 3 years ago
- A Kotlin Multiplatform Project utilizing ggwave, a data-over-sound library.☆20Nov 23, 2024Updated last year
- Anything Model Bacth Downloader allows you to batch download models from civitai, hugging face easily just through model url.☆14Mar 19, 2023Updated 3 years ago