GrantCuster / gemini-spatial-exampleLinks
How to use bounding boxes with the Gemini API
☆106Updated last year
Alternatives and similar repositories for gemini-spatial-example
Users that are interested in gemini-spatial-example are comparing it to the libraries listed below
Sorting:
- Demo of AI chatbot that predicts user message to generate response quickly.☆104Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated last year
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆213Updated 9 months ago
- OpenAI's Realtime API minus the enterprise bloat☆48Updated last year
- ☆47Updated last year
- converts url content into JSON with a simple prefix☆71Updated last year
- auto fine tune of models with synthetic data☆76Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆137Updated 6 months ago
- Lightweight open-source perplexity☆62Updated last year
- ☆81Updated last year
- Chat with your git repo☆160Updated last year
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆59Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 9 months ago
- A simple wrapper for OpenAI to log input/outputs.☆106Updated 2 years ago
- Globot is an agent that controls your browser using playwright and GPT-4V.☆134Updated last year
- A framework to enable multimodal models to play games on a computer.☆97Updated last year
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Updated last year
- ☆79Updated last year
- ☆42Updated last year
- Fluid Database☆113Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆152Updated last year
- ☆22Updated last year
- A simple demo application showcasing the power of Gemini 1.5 Pro's video understanding capabilities.☆29Updated last year
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- ☆107Updated 10 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 5 months ago
- A spotify playlist agent using CrewAI☆82Updated last year