GrantCuster / gemini-spatial-exampleLinks
How to use bounding boxes with the Gemini API
☆104Updated last year
Alternatives and similar repositories for gemini-spatial-example
Users that are interested in gemini-spatial-example are comparing it to the libraries listed below
Sorting:
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 9 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- OpenAI's Realtime API minus the enterprise bloat☆46Updated 8 months ago
- ☆47Updated last year
- converts url content into JSON with a simple prefix☆70Updated last year
- auto fine tune of models with synthetic data☆76Updated last year
- Build Web Datasets with Ease☆33Updated last year
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆209Updated 5 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆131Updated 2 months ago
- A couple scripts to grab stats from email☆43Updated 10 months ago
- Fluid Database☆114Updated 10 months ago
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆152Updated last year
- ☆42Updated last year
- ☆77Updated 7 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 5 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆63Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Globot is an agent that controls your browser using playwright and GPT-4V.☆134Updated last year
- ☆80Updated last year
- ☆108Updated 6 months ago
- Opensource chat app that uses Exa's API for web search and OpenAI o3-mini☆45Updated 2 months ago
- ☆4Updated 11 months ago
- Record voice notes & transcribe, summarize, and get tasks☆43Updated last year
- Personal memory for AI☆58Updated last year
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆293Updated last year
- ActBot is a prototype for an injectable chatbot to give any website agentic capabilities☆58Updated last year
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- The next evolution of Agents☆48Updated 2 weeks ago
- ☆172Updated 11 months ago