NSTiwari / PaliGemma-Android-HF
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆20Updated 5 months ago
Alternatives and similar repositories for PaliGemma-Android-HF:
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
- ☆21Updated 4 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆11Updated 2 weeks ago
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆34Updated last month
- ☆16Updated last year
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 2 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆20Updated 2 weeks ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated last year
- ☆16Updated last year
- RAG example using DSPy, Gradio, FastAPI☆75Updated 11 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated 8 months ago
- ☆14Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 5 months ago
- ☆30Updated last year
- BH hackathon☆14Updated 11 months ago
- Community ComfyUI workflows running on fal.ai☆57Updated 6 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 9 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆41Updated 6 months ago
- Rag Chatbot React And Tyepscript base boilerplate☆33Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated 7 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 3 months ago
- ☆17Updated 3 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- ☆13Updated last year
- Roboflow Workflows on ComfyUI☆32Updated 6 months ago