How to use bounding boxes with the Gemini API
☆106Jun 23, 2024Updated last year
Alternatives and similar repositories for gemini-spatial-example
Users that are interested in gemini-spatial-example are comparing it to the libraries listed below
Sorting:
- ☆22Jun 3, 2024Updated last year
- Vanilla-Python ergonomics on top of DSPy☆40Jun 3, 2025Updated 8 months ago
- ☆20Mar 3, 2024Updated last year
- A simple demo application showcasing the power of Gemini 1.5 Pro's video understanding capabilities.☆31May 24, 2024Updated last year
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- Wrangler Compatible Cloudflare Deployment API☆19Aug 25, 2025Updated 6 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆90Nov 26, 2024Updated last year
- ☆35Aug 16, 2024Updated last year
- Fast, 100% local web page summarization with Microsoft Phi-3☆33Apr 29, 2024Updated last year
- Embed anything.☆27May 24, 2024Updated last year
- ☆20Apr 24, 2025Updated 10 months ago
- Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"☆24Feb 5, 2026Updated 3 weeks ago
- doug is an ai experiment with openai, llama-cpp-python, and langchain☆16Sep 2, 2025Updated 5 months ago
- Codebase exploration with AI research agents☆19Feb 25, 2025Updated last year
- ☆13Nov 3, 2023Updated 2 years ago
- ☆71Mar 18, 2024Updated last year
- ☆14Sep 16, 2024Updated last year
- ☆13Sep 4, 2024Updated last year
- A real-time voice AI agent built with Groq API that enables natural voice conversations with configurable AI models, voices, and system p…☆30Sep 18, 2025Updated 5 months ago
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆76Jun 30, 2025Updated 8 months ago
- ☆39Oct 8, 2024Updated last year
- An AI-powered Snake game where Claude, an advanced language model, controls the serpent in real-time, showcasing intelligent decision-mak…☆47Nov 6, 2024Updated last year
- VSCode extension for ZenML☆21Updated this week
- A chrome extension can help user to search on muiltiple AI search engine by oneclick☆17Mar 22, 2024Updated last year
- ☆19Aug 15, 2024Updated last year
- ☆23Mar 31, 2025Updated 11 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Mar 16, 2023Updated 2 years ago
- An automated tool for discovering insights from research papaer corpora☆137Jun 8, 2024Updated last year
- Model Context Protocol Servers (Browserbase Version)☆49Nov 26, 2024Updated last year
- A collection of tools for your LLMs that run on Modal☆23Feb 28, 2025Updated last year
- ☆21Oct 8, 2024Updated last year
- How to build a real-time transcription solution☆24Mar 5, 2025Updated 11 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆21May 2, 2024Updated last year
- ☆282Jun 4, 2024Updated last year
- ☆28May 27, 2024Updated last year
- ☆22Oct 19, 2024Updated last year
- A Cloudflare Workers-based API for extracting and converting web page content to Markdown using DOM-Distiller and Turndown.☆29Updated this week
- ☆33Sep 17, 2024Updated last year
- ☆50Updated this week