GrantCuster / gemini-json-bounding-box-exampleLinks
☆21Updated last year
Alternatives and similar repositories for gemini-json-bounding-box-example
Users that are interested in gemini-json-bounding-box-example are comparing it to the libraries listed below
Sorting:
- How to use bounding boxes with the Gemini API☆103Updated last year
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆23Updated 8 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Gradio UI for a Cog API☆69Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆75Updated last year
- Jockey is a conversational video agent.☆82Updated last month
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆42Updated 5 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆18Updated last month
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated 10 months ago
- Opensource chat app that uses Exa's API for web search and OpenAI o3-mini☆44Updated last month
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated last year
- A couple scripts to grab stats from email☆43Updated 10 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 10 months ago
- ☆75Updated 7 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 9 months ago
- For LLMs to better code with Jina API☆158Updated 3 weeks ago
- auto fine tune of models with synthetic data☆76Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆63Updated last year
- Simple Graph Memory for AI applications☆88Updated 2 months ago
- A function to do all☆36Updated last year
- Create keyboard shortcuts for an LLM using OpenAI GPT, Ollama, HuggingFace with Automator on macOS.☆152Updated last year
- Community ComfyUI workflows running on fal.ai☆58Updated 10 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 9 months ago
- InterfaceAgent: a versatile framework designed to create system and interface agents capable of managing mobile and desktop applications …☆113Updated last year
- Chrome extension that interacts with content using Groq☆40Updated 6 months ago
- ☆28Updated 7 months ago
- ☆95Updated 7 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 8 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆100Updated 6 months ago