NSTiwari / PaliGemma-Android-HFLinks
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆20Updated last year
Alternatives and similar repositories for PaliGemma-Android-HF
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆31Updated 2 years ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- ☆22Updated last year
- All the world is a play, we are but actors in it.☆50Updated 4 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 9 months ago
- ☆25Updated last year
- ☆29Updated 2 years ago
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated last year
- ☆86Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated last year
- BH hackathon☆14Updated last year
- The next evolution of Agents☆48Updated 2 weeks ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Updated last year
- ☆13Updated last year
- ☆117Updated last year
- An Android app running inference on Meta's Segment-Anything (SAM) and SAM v2☆55Updated 10 months ago
- LoRA Explorer model to explore Flux.1[Schnell] with LoRAs☆31Updated last year
- ☆69Updated 8 months ago
- Cerule - A Tiny Mighty Vision Model☆68Updated last month
- run ollama & gguf easily with a single command☆52Updated last year
- ☆28Updated last year
- Gradio UI for a Cog API☆72Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated last year
- ☆27Updated last year
- Generate Stunning Images and Craft Visual Stories for your Brand☆19Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- ☆11Updated last year