基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务
☆11Oct 30, 2024Updated last year
Alternatives and similar repositories for VisionAgent
Users that are interested in VisionAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DB-based Optical Chemical Structure Recognition☆12Sep 12, 2022Updated 3 years ago
- baseline method for CROCS 2024☆10Jan 24, 2024Updated 2 years ago
- A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligen…☆41Aug 5, 2025Updated 7 months ago
- some code for use k210 by Maixpy better☆15May 2, 2022Updated 3 years ago
- 抖音小视频解析API。可获取无水印链接。☆15May 23, 2024Updated last year
- 中文表格OCR识别系统,支持导出excel或者word表格☆15Sep 17, 2023Updated 2 years ago
- Highly interactive graph data visualization☆15Oct 13, 2021Updated 4 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- ☆12Dec 6, 2023Updated 2 years ago
- ☆11Aug 20, 2025Updated 7 months ago
- Milvus的中文文档教程☆15Jul 21, 2024Updated last year
- something for paper agent☆11Dec 18, 2024Updated last year
- 基于改进YOLOv7和CRNN的管道裂缝检测系统(源码&教程)☆23Dec 4, 2023Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago
- Extract the outline of the table from the paper form obtained from the photo and recognize the text content in the outline. 从拍照得到的纸质表格中检测…☆21Oct 12, 2021Updated 4 years ago
- [CVPR 26] MarkushGrapher: End-to-end Multimodal Recognition of Chemical Structures☆38Updated this week
- HW accelerated h264 encoding on the Allwinner V3s w/ mainline Linux☆24Jul 27, 2023Updated 2 years ago
- Defect-GLM:A Large Visual-Language Model for Industrial Defect Monitoring|首个用于工业缺陷监测的开源大规模视觉语言模型☆103Sep 21, 2024Updated last year
- ☆14Jan 14, 2020Updated 6 years ago
- ☆36Feb 11, 2026Updated last month
- Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练,生成模型,扩散模型☆16Nov 10, 2023Updated 2 years ago
- XML Editor is an online web-based tool, designed to create, view, format, edit, save and share xml file. This tool provides multiple feat…☆16Oct 23, 2021Updated 4 years ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- ☆10Feb 17, 2024Updated 2 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆20Mar 17, 2026Updated last week
- 表格检测和表结构识别☆24Dec 5, 2022Updated 3 years ago
- Bird's Eye View Calibration Toolkit☆17Jun 21, 2025Updated 9 months ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Jun 24, 2024Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 8 months ago
- Concept-based generative models☆13Dec 13, 2024Updated last year
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- A basic set of akka helper java classes and examples such as map reduce☆30Sep 21, 2012Updated 13 years ago
- ☆24Dec 10, 2024Updated last year
- ☆16Mar 26, 2025Updated 11 months ago
- 微信AI内容创作智能体,可自动完成信息爬取、内容整理、排版及草稿推送。涵盖Kaggle竞赛、HuggingFace论文以及ProductHunt产品资讯。☆16Aug 3, 2025Updated 7 months ago