bhimrazy / chat-with-phi-3-vision
Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.
☆33Updated 2 months ago
Alternatives and similar repositories for chat-with-phi-3-vision:
Users that are interested in chat-with-phi-3-vision are comparing it to the libraries listed below
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 9 months ago
- ☆26Updated 9 months ago
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆22Updated 5 months ago
- ☆18Updated 4 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- A new novel multi-modality (Vision) RAG architecture☆23Updated 5 months ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated 9 months ago
- High level tool use for LLMs☆34Updated 7 months ago
- Rag Chatbot React And Tyepscript base boilerplate☆32Updated 11 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 6 months ago
- ☆29Updated last year
- TinyClick: Single-Turn Agent for Empowering GUI Automation☆30Updated 5 months ago
- ☆56Updated 3 months ago
- Own your AI, search the web with it🌐😎☆82Updated 2 months ago
- Deep Research through Multi-Agents, using GraphRAG☆62Updated 4 months ago
- Text generation in Python, as easy as possible☆55Updated this week
- Awesome LLM application repo☆67Updated last week
- ☆106Updated 3 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆68Updated last week
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 8 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆104Updated 3 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆20Updated last week
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆56Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆86Updated 2 months ago
- ☆45Updated 11 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- RAG example using DSPy, Gradio, FastAPI☆76Updated 11 months ago
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year