NSTiwari / PaliGemma-Android-HF
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆19Updated 7 months ago
Alternatives and similar repositories for PaliGemma-Android-HF
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
Sorting:
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆12Updated 2 months ago
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆37Updated 3 months ago
- ☆21Updated 6 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆21Updated last week
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆46Updated last year
- ☆16Updated last year
- ☆16Updated last year
- ☆30Updated last year
- ☆29Updated last year
- ☆13Updated last year
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆39Updated last month
- ☆28Updated last year
- Run AuraFlow on Replicate☆14Updated 10 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 11 months ago
- ☆29Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆13Updated 4 months ago
- A discord bot to stay up to date with Hugging Face Daily Papers.☆14Updated last year
- Roboflow Workflows on ComfyUI☆33Updated 7 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 4 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- ☆32Updated last year
- Community ComfyUI workflows running on fal.ai☆57Updated 8 months ago
- BH hackathon☆14Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- ☆11Updated last year
- ☆14Updated last year
- ☆24Updated last year
- Synthetic text dataset generation☆9Updated this week