NSTiwari / PaliGemma-Android-HFLinks
This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for tasks such as zero-shot object detection, image captioning and visual question-answering.
☆19Updated 8 months ago
Alternatives and similar repositories for PaliGemma-Android-HF
Users that are interested in PaliGemma-Android-HF are comparing it to the libraries listed below
Sorting:
- ☆21Updated 7 months ago
- This repository is an implementation of converting sketches into lively videos using Google's Veo 3 model.☆39Updated this week
- ☆16Updated last year
- Run AuraFlow on Replicate☆14Updated 11 months ago
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆70Updated last year
- ☆11Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated last month
- ☆29Updated last year
- BH hackathon☆14Updated last year
- Generate Stunning Images and Craft Visual Stories for your Brand☆18Updated 8 months ago
- Community ComfyUI workflows running on fal.ai☆57Updated 9 months ago
- ☆13Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆47Updated last year
- ☆12Updated last year
- ☆1Updated 11 months ago
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆42Updated 2 months ago
- Content Recommendation is an open source platform that makes use of vector similarity search to provide highly relevant content recommend…☆16Updated last month
- Seamless Voice Interactions with LLMs☆12Updated last year
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆34Updated this week
- ☆31Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 8 months ago
- The very first artist assistant☆22Updated last year
- ☆12Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI☆10Updated 10 months ago
- AI Search engine☆12Updated 4 months ago
- ☆29Updated last year
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 5 months ago
- ☆19Updated 9 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated 2 weeks ago