NSTiwari / PaliGemma-Android-HFLinks
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆20Updated last year
Alternatives and similar repositories for PaliGemma-Android-HF
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- ☆29Updated last year
- ☆31Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- ☆21Updated last year
- ☆16Updated last year
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆73Updated last year
- BH hackathon☆14Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 6 months ago
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆54Updated 9 months ago
- ☆13Updated last year
- ☆40Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated 2 years ago
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- ☆28Updated last year
- GRDN.AI app for garden optimization☆70Updated last week
- All the world is a play, we are but actors in it.☆50Updated 4 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆34Updated 10 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 9 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 weeks ago
- Community ComfyUI workflows running on fal.ai☆57Updated last year
- Finetune any model on HF in less than 30 seconds☆56Updated last month
- ☆15Updated 2 years ago
- ML model (or several models) to describe the contents of the UI screen☆93Updated 3 months ago
- EdgeSAM model for use with Autodistill.☆29Updated last year
- ☆28Updated last year