ggerganov / ggwaveLinks
Tiny data-over-sound library
☆7,356Updated 2 months ago
Alternatives and similar repositories for ggwave
Users that are interested in ggwave are comparing it to the libraries listed below
Sorting:
- Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents☆4,731Updated 3 months ago
- Lightpanda: the headless browser designed for AI and automation☆10,325Updated last week
- Serverless, peer-to-peer, local file sharing through sound☆2,291Updated 4 years ago
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,104Updated 8 months ago
- A fast multimodal LLM for real-time voice☆4,258Updated 2 months ago
- A fast, local neural text to speech system☆10,241Updated 2 months ago
- Local realtime voice AI☆2,378Updated 8 months ago
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…☆8,932Updated 4 months ago
- Port of OpenAI's Whisper model in C/C++☆44,532Updated last week
- The python library for real-time communication☆4,403Updated 2 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,968Updated this week
- A generalist Python node editor☆2,690Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,385Updated last week
- SoTA open-source TTS☆14,677Updated last month
- Distribute and run LLMs with a single file.☆23,402Updated 2 weeks ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,416Updated 10 months ago
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,617Updated this week
- A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas…☆2,948Updated 11 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆15,979Updated 2 months ago
- Lightweight coding agent that runs in your terminal☆2,146Updated 6 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,101Updated this week
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆5,162Updated last month
- 🍏 + 🎯 + 🐍 = Query Apple's FindMy Network with Python!☆2,580Updated 2 weeks ago
- build-once run-anywhere c library☆20,200Updated this week
- A vector search SQLite extension that runs anywhere!☆6,412Updated 9 months ago
- A python client + documentation for the Colmi R02 smart ring☆564Updated 7 months ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,536Updated this week
- State-of-the-art TTS model under 25MB 😻☆9,099Updated 3 months ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simple☆5,569Updated this week
- Towards Human-Sounding Speech☆5,729Updated 6 months ago