A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligent analysis capabilities.
☆42Aug 5, 2025Updated 8 months ago
Alternatives and similar repositories for realtime_vlm_system
Users that are interested in realtime_vlm_system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆12Sep 5, 2023Updated 2 years ago
- Loosely-coupled GNSS/INS integrated navigation system☆11Jan 15, 2020Updated 6 years ago
- BP神经网络+Kalman滤波器融合预测模型(预测点坐标)☆17Apr 21, 2018Updated 7 years ago
- ☆12Dec 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multivariate LSTM on PyTorch to predict stock market prices☆12May 29, 2021Updated 4 years ago
- ☆12Sep 15, 2024Updated last year
- attempt to predict the stock price with BP neural network☆14Mar 27, 2017Updated 9 years ago
- Maaagic UI is an open-source UI framework designed to empower developers with seamless integration and advanced features of AI applicatio…☆14Jun 18, 2024Updated last year
- Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练,生成模型,扩散模型☆16Nov 10, 2023Updated 2 years ago
- Loosely Coupled INS/GNSS Error State Extended Kalman Filter in Python/C++☆22Jun 7, 2020Updated 5 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- 这是一个基于 Electron 和 Vue3 开发的桌面应用模板,支持跨平台运行(Windows/macOS/Linux)。模板包含登录、导航栏、扩展窗口、最小化、SSO 登录集成等核心功能,适合快速启动桌面应用开发。☆22Dec 3, 2024Updated last year
- ☆25Dec 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- AutoQuant is an out-of-the-box quantitative investment platform.☆20Aug 1, 2023Updated 2 years ago
- 一个低代码、可定制(颜色、字体、模块)的LaTeX中文模板(自用/持续更新)☆29Feb 13, 2026Updated 2 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- 使用bp神经网络预测电力负荷,使用小型数据集,通过一个简单的例子。Using BPNN to predict power load, using small data set, a simple example.☆27May 4, 2020Updated 5 years ago
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- ☆18Mar 1, 2024Updated 2 years ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 2 years ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A NL2SQL plugin based on FocusSearch keyword parsing, offering greater accuracy, higher speed, and more reliability!☆39Apr 14, 2025Updated last year
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 3 months ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- ☆34May 29, 2025Updated 10 months ago
- IEEE 802.11n CSI and camera synchronization toolkit.☆16Mar 17, 2026Updated 3 weeks ago
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- ☆28Mar 30, 2026Updated 2 weeks ago
- code for "MVOC:atraining-free multiple video object composition method with diffusion models"☆23Jul 3, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch code for AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning☆24Mar 22, 2025Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆29Mar 15, 2026Updated 3 weeks ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- ☆29Jan 3, 2025Updated last year