akdeb / ElatoAILinks
Realtime AI speech with OpenAI Realtime API and Gemini Live API on Arduino ESP32 with Secure Websockets and Deno edge functions with >15 minutes uninterrupted conversations globally for AI toys, AI companions, AI devices and more
โ1,044Updated 2 weeks ago
Alternatives and similar repositories for ElatoAI
Users that are interested in ElatoAI are comparing it to the libraries listed below
Sorting:
- ๐ discover story relationshipsโ335Updated this week
- โ845Updated last month
- Instructions on how to use the Realtime API on Microcontrollers and Embedded Platformsโ1,559Updated 3 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)โ653Updated last month
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps ๐ฃ๏ธ๐ฏโ852Updated 3 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webโ2,284Updated 2 weeks ago
- Computer use SDK for building agents that learn from human screen recordings. Cross-platform (Windows/macOS/Linux), deterministic, and reโฆโ664Updated this week
- Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.โ565Updated last month
- A self-hosted API that takes a URL and returns a file with browser screenshots.โ977Updated 3 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anโฆโ858Updated 9 months ago
- Have a natural, spoken conversation with AI!โ2,609Updated last week
- A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems fโฆโ1,051Updated last month
- Local Video-LLM powered AI Baby Monitorโ377Updated last month
- VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programsโ344Updated 4 months ago
- โ579Updated 2 weeks ago
- Self-hosted voice chat with LLMsโ432Updated 3 months ago
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)โ659Updated 3 months ago
- A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words โฆโ635Updated 4 months ago
- A conversational, AI device + software framework for companionship, entertainment, education, healthcare, IoT applications, and DIY robotโฆโ518Updated 4 months ago
- Examples for Cerebrium Serverless GPUsโ492Updated last week
- A cache for AI agents to learn and replay complex behaviors.โ670Updated last week
- first base model for full-duplex conversational audioโ1,749Updated 5 months ago
- MCP server for fetch web page content using Playwright headless browser.โ738Updated last week
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.โ774Updated 5 months ago
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, โฆโ463Updated this week
- directory for Awesome MCP Serversโ1,757Updated 3 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,โฆโ381Updated 3 weeks ago
- ๐ Modern open-source fitness coaching platform. Create workout plans, track progress, and access a comprehensive exercise database.โ3,170Updated this week
- Send Morse code via โฎ๏ธ โธ๏ธ โฏ๏ธโ412Updated 7 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open sourceโ183Updated 2 weeks ago