A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligent analysis capabilities.
☆41Aug 5, 2025Updated 7 months ago
Alternatives and similar repositories for realtime_vlm_system
Users that are interested in realtime_vlm_system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Human Parsing Based Alignment with Multi-task Learning for Occluded Person Re-identification, ICME 2020 Oral☆12Sep 2, 2020Updated 5 years ago
- Official PyTorch implementation of FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion☆13Oct 25, 2022Updated 3 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- ☆12Dec 6, 2023Updated 2 years ago
- ICCV 2019 Workshop & Challenge on Computer Vision for Wildlife Conservation (CVWC).☆15Aug 27, 2019Updated 6 years ago
- Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification ACM MM 2021☆17Sep 14, 2021Updated 4 years ago
- ☆12Sep 15, 2024Updated last year
- Maaagic UI is an open-source UI framework designed to empower developers with seamless integration and advanced features of AI applicatio…☆14Jun 18, 2024Updated last year
- Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练,生成模型,扩散模型☆16Nov 10, 2023Updated 2 years ago
- This code is for the Tiger Re-ID in the Wild track CVWC2019 (Detection part)☆20Aug 27, 2019Updated 6 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- ☆24Dec 10, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- JDATA2019 雪豹识别挑战赛冠军方案☆23Feb 24, 2020Updated 6 years ago
- Official Implementation of "Vision-based Behavioral Recognition of Novelty Preference in Pigs"☆21Jul 1, 2021Updated 4 years ago
- Created with Stability AIʼs Stable Video Diffusion XT 1.1 Image-to-Video latent diffusion model (SVD XT 1.1)☆17Apr 12, 2024Updated last year
- FlappyBird Reinforcement Learning based on Pygame, OpenCV, Tensorflow☆13Mar 16, 2020Updated 6 years ago
- a collection of datasets for the re-identification of animal individuals☆39Mar 4, 2026Updated 2 weeks ago
- Official implementation of the paper "Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface …☆26Apr 4, 2025Updated 11 months ago
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- A Chrome DevTools Extension for OpenSumi.☆14Apr 22, 2024Updated last year
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 2 years ago
- ☆18Mar 1, 2024Updated 2 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- A Distributed web crawler system. Support for templated spider development.☆15Apr 7, 2017Updated 8 years ago
- Chrome extension to sync the clipboard between computers☆27Jul 26, 2014Updated 11 years ago
- ☆19Dec 13, 2023Updated 2 years ago
- A NL2SQL plugin based on FocusSearch keyword parsing, offering greater accuracy, higher speed, and more reliability!☆38Apr 14, 2025Updated 11 months ago
- ☆63Dec 5, 2025Updated 3 months ago
- code for CVPR2019 paper VPM (Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification)☆31Aug 20, 2020Updated 5 years ago
- Cross-platform clipboard (copy and paste) sync tool.☆23Jan 9, 2023Updated 3 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated 2 months ago
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning☆102Sep 19, 2025Updated 6 months ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- ☆27Jan 5, 2026Updated 2 months ago
- python client of https://cgit.sukimashita.com/usbmuxd.git☆10Jul 23, 2016Updated 9 years ago