bhimrazy / chat-with-phi-3-vision
Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.
β33Updated 4 months ago
Alternatives and similar repositories for chat-with-phi-3-vision:
Users that are interested in chat-with-phi-3-vision are comparing it to the libraries listed below
- β21Updated 6 months ago
- A new novel multi-modality (Vision) RAG architectureβ27Updated 7 months ago
- Agentic RAG to help you build a startupπβ41Updated last month
- Deep Research through Multi-Agents, using GraphRAGβ67Updated 5 months ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceβ91Updated 10 months ago
- Own your AI, search the web with itππβ85Updated 3 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Reβ¦β21Updated last month
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA onβ¦β44Updated last year
- β56Updated 5 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 8 months ago
- This template demonstrates how to create a collaborative team of AI agents that work together to process, analyze, and generate insights β¦β31Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β80Updated 11 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generationβ72Updated last month
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlitβ56Updated last year
- Tutorial for DSPyβ23Updated last year
- β27Updated 10 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.β65Updated 7 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.β17Updated last year
- β111Updated 5 months ago
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.β26Updated 11 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.β45Updated last year
- β37Updated last month
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β23Updated last year
- Simple examples using Argilla tools to build AIβ52Updated 5 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β19Updated 6 months ago
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.β24Updated 7 months ago
- β1Updated 9 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β19Updated 2 weeks ago
- RAG example using DSPy, Gradio, FastAPIβ79Updated last year
- Langchain Usecasesβ16Updated 11 months ago