GrantCuster / gemini-spatial-example
How to use bounding boxes with the Gemini API
☆102Updated 10 months ago
Alternatives and similar repositories for gemini-spatial-example
Users that are interested in gemini-spatial-example are comparing it to the libraries listed below
Sorting:
- Demo of AI chatbot that predicts user message to generate response quickly.☆102Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 7 months ago
- OpenAI's Realtime API minus the enterprise bloat☆46Updated 5 months ago
- ☆47Updated last year
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆56Updated 7 months ago
- converts url content into JSON with a simple prefix☆68Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆133Updated 8 months ago
- A couple scripts to grab stats from email☆42Updated 8 months ago
- ☆21Updated 11 months ago
- A simple wrapper for OpenAI to log input/outputs.☆104Updated last year
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 2 months ago
- A spotify playlist agent using CrewAI☆81Updated 11 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆64Updated 3 months ago
- ☆75Updated 5 months ago
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆206Updated 2 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- ☆28Updated 5 months ago
- The next evolution of Agents☆48Updated 3 weeks ago
- Build your Swarm of Internet Agents using MultiOn 🚀☆78Updated last year
- An automated tool for discovering insights from research papaer corpora☆138Updated 11 months ago
- ☆79Updated last year
- Record voice notes & transcribe, summarize, and get tasks☆42Updated last year
- Fluid Database☆114Updated 7 months ago
- Turns an Airtable base into a WebGL knowledge graph leveraging relational columns☆33Updated last year
- A simple demo application showcasing the power of Gemini 1.5 Pro's video understanding capabilities.☆29Updated 11 months ago
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆96Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆151Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 10 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆74Updated last year