bhimrazy / chat-with-qwen2-vl
Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆9Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for chat-with-qwen2-vl
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated 9 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 6 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 5 months ago
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago
- Tool to take your ML model from local to production with one-line of code.☆23Updated 10 months ago
- Build Agentic workflows with function calling☆20Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆36Updated 3 weeks ago
- ☆58Updated 8 months ago
- ☆14Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆32Updated 2 months ago
- ☆16Updated 9 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆21Updated this week
- Notebooks using the Neural Magic libraries 📓☆41Updated 3 months ago
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆26Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 8 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 weeks ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆13Updated 9 months ago
- ☆12Updated 2 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 11 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆12Updated last week
- ☆20Updated 9 months ago
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆16Updated this week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆11Updated last month
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated last week
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year