GrantCuster / gemini-spatial-example
How to use bounding boxes with the Gemini API
☆102Updated 9 months ago
Alternatives and similar repositories for gemini-spatial-example:
Users that are interested in gemini-spatial-example are comparing it to the libraries listed below
- ☆21Updated 9 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 5 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- OpenAI's Realtime API minus the enterprise bloat☆44Updated 4 months ago
- converts url content into JSON with a simple prefix☆67Updated 10 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Chat with your git repo☆155Updated last year
- ☆47Updated 11 months ago
- The next evolution of Agents☆48Updated 2 weeks ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆73Updated last year
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆75Updated last year
- Automates the process of prompt engineering using Anthropic's Claude language model.☆64Updated last year
- A simple demo application showcasing the power of Gemini 1.5 Pro's video understanding capabilities.☆29Updated 10 months ago
- Build your Swarm of Internet Agents using MultiOn 🚀☆78Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆126Updated 2 weeks ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆123Updated 6 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆97Updated 2 months ago
- ☆77Updated last year
- A couple scripts to grab stats from email☆42Updated 6 months ago
- Fluid Database☆114Updated 6 months ago
- An AI-powered Snake game where Claude, an advanced language model, controls the serpent in real-time, showcasing intelligent decision-mak…☆44Updated 4 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆61Updated last month
- A spotify playlist agent using CrewAI☆81Updated 10 months ago
- A memory manager essential for evolving AI to be more human-like, enabling dynamic, context-aware responses through structured memory han…☆28Updated 11 months ago
- An automated tool for discovering insights from research papaer corpora☆137Updated 9 months ago
- For LLMs to better code with Jina API☆139Updated 2 weeks ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆56Updated 6 months ago
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Updated 9 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆204Updated last month