NSTiwari / PaliGemma-Android-HF
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆19Updated last month
Related projects ⓘ
Alternatives and complementary repositories for PaliGemma-Android-HF
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆14Updated 2 months ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆15Updated 5 months ago
- ☆16Updated 9 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- ☆16Updated this week
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆30Updated last month
- Gradio based tool to run opensource LLM models directly from Huggingface☆87Updated 4 months ago
- ☆28Updated 7 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 4 months ago
- GGUF Quantization of any LLM.☆29Updated 8 months ago
- ☆30Updated 10 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆22Updated 10 months ago
- ☆29Updated 11 months ago
- BH hackathon☆14Updated 7 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 7 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆17Updated last month
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆20Updated 2 months ago
- A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).☆23Updated 3 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆19Updated 3 weeks ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆32Updated 10 months ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆15Updated 6 months ago
- ☆17Updated 6 months ago
- ☆14Updated 8 months ago
- ☆44Updated 3 months ago
- Starter app for creating an AI task completion agent with gmail capabilities.☆26Updated 4 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆38Updated 8 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- ☆1Updated 3 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Routing on Random Forest (RoRF)☆82Updated last month