bhimrazy / chat-with-phi-3-visionLinks
Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.
☆34Updated 11 months ago
Alternatives and similar repositories for chat-with-phi-3-vision
Users that are interested in chat-with-phi-3-vision are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆114Updated last year
- Agentic RAG to help you build a startup🚀☆55Updated 8 months ago
- A new novel multi-modality (Vision) RAG architecture☆33Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- ☆22Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated 11 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- ☆56Updated last year
- ☆101Updated last year
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- ☆182Updated 9 months ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆133Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆123Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated 11 months ago
- An agent to generate stunning images :)☆23Updated 6 months ago
- Notebooks for fine tuning pali gemma☆117Updated 7 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 months ago
- ☆125Updated 9 months ago
- Own your AI, search the web with it🌐😎☆92Updated 10 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- Awesome LLM application repo☆87Updated 9 months ago
- ☆43Updated last week
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.☆25Updated last week
- Fine tune Gemma 3 on an object detection task☆91Updated 4 months ago
- ☆26Updated last year
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆24Updated 10 months ago