GrantCuster / gemini-spatial-example
How to use bounding boxes with the Gemini API
☆102Updated 10 months ago
Alternatives and similar repositories for gemini-spatial-example:
Users that are interested in gemini-spatial-example are comparing it to the libraries listed below
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 6 months ago
- ☆21Updated 10 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- A couple scripts to grab stats from email☆42Updated 7 months ago
- Record voice notes & transcribe, summarize, and get tasks☆41Updated last year
- converts url content into JSON with a simple prefix☆68Updated 11 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆127Updated 7 months ago
- ☆47Updated last year
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated last year
- OpenAI's Realtime API minus the enterprise bloat☆45Updated 5 months ago
- Fluid Database☆114Updated 7 months ago
- Build your Swarm of Internet Agents using MultiOn 🚀☆78Updated last year
- ☆75Updated 4 months ago
- The Moshi speech-to-speech model, deployed to Modal with a realtime CLI chat☆56Updated 7 months ago
- The end of Screenshot 2023-12-20-21.11.59.png☆94Updated last year
- The next evolution of Agents☆48Updated last week
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated last month
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 2 months ago
- Anthropic Computer Use with Modal Sandboxes☆31Updated 6 months ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆74Updated last year
- Chat with your git repo☆154Updated last year
- Gradio UI for a Cog API☆67Updated last year
- ☆77Updated last year
- ☆29Updated 4 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆63Updated 2 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Globot is an agent that controls your browser using playwright and GPT-4V.☆134Updated last year
- An AI-powered Snake game where Claude, an advanced language model, controls the serpent in real-time, showcasing intelligent decision-mak…☆44Updated 5 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆70Updated 5 months ago