akdeb / ElatoAILinks
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP32 with >15 minutes uninterrupted conversations globally for AI toys, AI companions, AI devices and more
☆1,344Updated this week
Alternatives and similar repositories for ElatoAI
Users that are interested in ElatoAI are comparing it to the libraries listed below
Sorting:
- Local Video-LLM powered AI Baby Monitor☆465Updated 8 months ago
- Instructions on how to use the Realtime API on Microcontrollers and Embedded Platforms☆1,578Updated 10 months ago
- ☆887Updated 8 months ago
- 📚 discover story relationships☆347Updated 7 months ago
- A conversational, AI device + software framework for companionship, entertainment, education, healthcare, IoT applications, and DIY robot…☆542Updated 11 months ago
- A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words …☆1,928Updated 5 months ago
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, …☆504Updated 2 weeks ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆681Updated 8 months ago
- Raspberry Pi Voice Assistant☆811Updated last year
- Ultra-lightweight AI Agent☆424Updated 5 months ago
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)☆719Updated 2 months ago
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆852Updated 2 months ago
- Self-hosted voice chat with LLMs☆461Updated 10 months ago
- A self-hosted API that takes a URL and returns a file with browser screenshots.☆1,054Updated 10 months ago
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.☆665Updated 6 months ago
- Open-source framework for developing real-time multimodal conversational AI agents.☆583Updated this week
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆541Updated 2 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,333Updated 7 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆886Updated last month
- Examples for Cerebrium Serverless GPUs☆515Updated 3 weeks ago
- ⚡A developer-oriented library of sleek, bubble-shaped skill icons designed for GitHub READMEs, portfolios, and resumes.☆513Updated 3 weeks ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆783Updated last year
- ☆106Updated last year
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic f…☆879Updated last month
- MCP server for fetch web page content using Playwright headless browser.☆968Updated last week
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆414Updated 5 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆782Updated 3 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source☆321Updated 7 months ago
- A cache for AI agents to learn and replay complex behaviors.☆756Updated 7 months ago
- A MCP server implementation for hyperbrowser☆714Updated 2 months ago