bhimrazy / chat-with-qwen2-vl
Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆8Updated last month
Related projects ⓘ
Alternatives and complementary repositories for chat-with-qwen2-vl
- Tool to take your ML model from local to production with one-line of code.☆23Updated 9 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Build Agentic workflows with function calling☆20Updated last week
- ChatBot App built using LangChain and Lightning AI☆17Updated last year
- ☆12Updated last week
- Github repo for Peifeng's internship project☆12Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated 9 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆31Updated last month
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆26Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆9Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆11Updated this week
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆42Updated 3 months ago
- Notebooks using the Neural Magic libraries 📓☆41Updated 3 months ago
- Ultra-minimal autoregressive diffusion model for image generation☆15Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆19Updated 3 weeks ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 8 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆20Updated last week
- BH hackathon☆14Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- ☆24Updated last year
- ☆20Updated 9 months ago
- ☆14Updated 5 months ago
- Tutorial for DSPy☆21Updated 6 months ago