NSTiwari / PaliGemma-Android-HFLinks
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆20Updated last year
Alternatives and similar repositories for PaliGemma-Android-HF
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
Sorting:
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- ☆17Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- ☆21Updated last year
- All the world is a play, we are but actors in it.☆49Updated 6 months ago
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆74Updated last year
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆57Updated last year
- BH hackathon☆14Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated last year
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- ☆29Updated 2 years ago
- ☆31Updated 2 years ago
- ☆52Updated 2 years ago
- A custom RAG pipeline for multi-document QA from PDF/DOCX documents, in Android☆172Updated 3 weeks ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 11 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- ☆51Updated last year
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆34Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆82Updated last year
- ☆78Updated 2 years ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- 100% Local Document deep search with LLMs☆26Updated last year
- ☆27Updated last year
- Run Ollama LLM models in Google Colab for free☆37Updated last year
- Run AuraFlow on Replicate☆14Updated last year