A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligent analysis capabilities.
☆48Aug 5, 2025Updated 10 months ago
Alternatives and similar repositories for realtime_vlm_system
Users that are interested in realtime_vlm_system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Human Parsing Based Alignment with Multi-task Learning for Occluded Person Re-identification, ICME 2020 Oral☆12Sep 2, 2020Updated 5 years ago
- Scripts, data and researches related to cow weight and breed prediction☆13Jun 2, 2026Updated last month
- a new framework for animal behavior automated recognition and measurement☆16May 23, 2025Updated last year
- Official PyTorch implementation of FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion☆13Oct 25, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- TOP4 solution of 2019 iQIYI Celebrity Video Identification Challenge☆20Nov 1, 2019Updated 6 years ago
- ICCV 2019 Workshop & Challenge on Computer Vision for Wildlife Conservation (CVWC).☆16Aug 27, 2019Updated 6 years ago
- UR3 simulation with matlab☆17Dec 5, 2021Updated 4 years ago
- Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification ACM MM 2021☆17Sep 14, 2021Updated 4 years ago
- ☆32May 27, 2025Updated last year
- The third place of the 2019 iQIYI Celebrity Video Identification Challenge☆17Nov 17, 2019Updated 6 years ago
- 基于micropython的esp32s3+豆包语音智能体实时语音对话智能助手☆29Jun 29, 2025Updated last year
- Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练,生成模型,扩散模型☆17Nov 10, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation for WACV2021 paper "Enhancing Diversity in Teacher-Student Networks via Asymmetric branches for Unsupervised Person Re-ide…☆19Jun 7, 2021Updated 5 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- ☆15Apr 28, 2023Updated 3 years ago
- Automatic livestock body measurement based on keypoint detection with multiple depth cameras☆28Oct 11, 2021Updated 4 years ago
- Code for CVPR 2019 paper “Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training”☆22Apr 9, 2021Updated 5 years ago
- ✍️ Yet another WYSIWYG Markdown editor written in TypeScript and Tauri. (🤗WIP)☆29May 25, 2025Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆22Aug 11, 2025Updated 10 months ago
- JDATA2019 雪豹识别挑战赛冠军方案☆23Feb 24, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 一个低代码、可定制(颜色、字体、模块)的LaTeX中文模板(自用/持续更新)☆43Feb 13, 2026Updated 4 months ago
- Official Implementation of "Vision-based Behavioral Recognition of Novelty Preference in Pigs"☆22Jul 1, 2021Updated 5 years ago
- FlappyBird Reinforcement Learning based on Pygame, OpenCV, Tensorflow☆14Mar 16, 2020Updated 6 years ago
- a collection of datasets for the re-identification of animal individuals☆47May 26, 2026Updated last month
- Official implementation of the paper "Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface …☆30Apr 4, 2025Updated last year
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated 2 years ago
- Who's Waldo? Linking People Across Text and Images. ICCV 2021.☆13May 17, 2023Updated 3 years ago
- ☆18Jan 18, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Jan 11, 2022Updated 4 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- A Distributed web crawler system. Support for templated spider development.☆15Apr 7, 2017Updated 9 years ago
- 基于micropython的xiaozhi☆40Apr 19, 2025Updated last year
- Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models☆37Sep 19, 2023Updated 2 years ago
- A NL2SQL plugin based on FocusSearch keyword parsing, offering greater accuracy, higher speed, and more reliability!☆39Apr 14, 2025Updated last year
- code for CVPR2019 paper VPM (Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification)☆32Aug 20, 2020Updated 5 years ago