bhimrazy / chat-with-phi-3-visionLinks
Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.
☆33Updated 5 months ago
Alternatives and similar repositories for chat-with-phi-3-vision
Users that are interested in chat-with-phi-3-vision are comparing it to the libraries listed below
Sorting:
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Agentic RAG to help you build a startup🚀☆44Updated 2 months ago
- A new novel multi-modality (Vision) RAG architecture☆28Updated 9 months ago
- ☆21Updated 7 months ago
- ☆113Updated 7 months ago
- Own your AI, search the web with it🌐😎☆86Updated 5 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 9 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 4 months ago
- Tutorial for DSPy☆23Updated last year
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆34Updated 11 months ago
- ☆56Updated 7 months ago
- Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.☆58Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated last month
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated this week
- Streamlit app presented to the Streamlit LLMs Hackathon September 23☆16Updated last year
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆25Updated 3 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆73Updated 9 months ago
- Deep Research through Multi-Agents, using GraphRAG☆75Updated 7 months ago
- High level tool use for LLMs☆34Updated 11 months ago
- A clean Gradio theme with dark and light variants.☆35Updated last year
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆77Updated 3 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 8 months ago
- ☆28Updated last year
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.☆30Updated last year
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆123Updated 3 weeks ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- This is the offical page of WikiAutoGen, ICCV2025☆15Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆23Updated 9 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year