A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligent analysis capabilities.
☆45Aug 5, 2025Updated 9 months ago
Alternatives and similar repositories for realtime_vlm_system
Users that are interested in realtime_vlm_system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- A simplified version of MPN☆13May 21, 2021Updated 5 years ago
- [ICIP2021] The official PyTorch implementation of MASK GUIDED ATTENTION FOR FINE-GRAINED PATCHY IMAGE CLASSIFICATION☆11Nov 3, 2021Updated 4 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- ICCV 2019 Workshop & Challenge on Computer Vision for Wildlife Conservation (CVWC).☆16Aug 27, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification ACM MM 2021☆17Sep 14, 2021Updated 4 years ago
- ☆12Sep 15, 2024Updated last year
- Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练,生成模型,扩散模型☆17Nov 10, 2023Updated 2 years ago
- Implementation for WACV2021 paper "Enhancing Diversity in Teacher-Student Networks via Asymmetric branches for Unsupervised Person Re-ide…☆18Jun 7, 2021Updated 4 years ago
- This code is for the Tiger Re-ID in the Wild track CVWC2019 (Detection part)☆20Aug 27, 2019Updated 6 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- ☆15Apr 28, 2023Updated 3 years ago
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆27Mar 10, 2026Updated 2 months ago
- Automatic livestock body measurement based on keypoint detection with multiple depth cameras☆28Oct 11, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for CVPR 2019 paper “Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training”☆22Apr 9, 2021Updated 5 years ago
- AutoQuant is an out-of-the-box quantitative investment platform.☆20Aug 1, 2023Updated 2 years ago
- JDATA2019 雪豹识别挑战赛冠军方案☆23Feb 24, 2020Updated 6 years ago
- Official Implementation of "Vision-based Behavioral Recognition of Novelty Preference in Pigs"☆22Jul 1, 2021Updated 4 years ago
- FlappyBird Reinforcement Learning based on Pygame, OpenCV, Tensorflow☆14Mar 16, 2020Updated 6 years ago
- a collection of datasets for the re-identification of animal individuals☆46Mar 4, 2026Updated 2 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- 使用bp神经网络预测电力负荷,使用小型数据集,通过一个简单的例子。Using BPNN to predict power load, using small data set, a simple example.☆27May 4, 2020Updated 6 years ago
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- Using HRNet for CVWC 2019 Tiger Pose Track Challenge.☆27Oct 12, 2021Updated 4 years ago
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 3 years ago
- ☆18Mar 1, 2024Updated 2 years ago
- 富士康-金属件-自动化尺寸测量-计算机视觉☆32Mar 24, 2023Updated 3 years ago
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- code for CVPR2019 paper VPM (Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification)☆31Aug 20, 2020Updated 5 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆67Dec 5, 2025Updated 5 months ago
- Vision-Language Pretraining & Efficient Transformer Papers.☆15Nov 30, 2021Updated 4 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆26May 31, 2025Updated 11 months ago
- Building a multi-agent RAG system with advanced RAG methods☆13Jan 12, 2025Updated last year
- ☆13Feb 5, 2025Updated last year
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- ☆37May 29, 2025Updated 11 months ago