NSTiwari / PaliGemma-Android-HFLinks
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆20Updated last year
Alternatives and similar repositories for PaliGemma-Android-HF
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
Sorting:
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆29Updated last year
- ML model (or several models) to describe the contents of the UI screen☆89Updated 2 months ago
- A custom RAG pipeline for multi-document QA from PDF/DOCX documents, in Android☆142Updated last week
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆71Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 9 months ago
- ☆21Updated 11 months ago
- ☆28Updated last year
- ☆86Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 7 months ago
- ☆16Updated last year
- Cerule - A Tiny Mighty Vision Model☆67Updated last year
- The next evolution of Agents☆47Updated last week
- All the world is a play, we are but actors in it.☆50Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆24Updated last year
- BH hackathon☆13Updated last year
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆48Updated 8 months ago
- 100% Local Document deep search with LLMs☆26Updated last year
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- entropix style sampling + GUI☆27Updated 11 months ago
- Orpheus Server with streaming support (TTFB ~160ms)☆17Updated 3 weeks ago
- ☆95Updated 9 months ago
- ☆14Updated last year
- ☆26Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- ☆78Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 8 months ago