基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务
☆11Oct 30, 2024Updated last year
Alternatives and similar repositories for VisionAgent
Users that are interested in VisionAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DB-based Optical Chemical Structure Recognition☆13Sep 12, 2022Updated 3 years ago
- baseline method for CROCS 2024☆10Jan 24, 2024Updated 2 years ago
- A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligen…☆45Aug 5, 2025Updated 9 months ago
- some code for use k210 by Maixpy better☆15May 2, 2022Updated 4 years ago
- 抖音小视频解析API。可获取无水印链接。☆18May 23, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Highly interactive graph data visualization☆15Oct 13, 2021Updated 4 years ago
- 中文表格OCR识别系统,支持导出excel或者word表格☆16Sep 17, 2023Updated 2 years ago
- Image similarity estimation using a Siamese Network with a triplet loss☆11Jul 27, 2023Updated 2 years ago
- ☆12Dec 6, 2023Updated 2 years ago
- Milvus的中文文档教程☆15Jul 21, 2024Updated last year
- ☆11Aug 20, 2025Updated 9 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- 基于改进YOLOv7和CRNN的管道裂缝检测系统(源码&教程)☆24Dec 4, 2023Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?☆12Dec 17, 2025Updated 5 months ago
- Extract the outline of the table from the paper form obtained from the photo and recognize the text content in the outline. 从拍照得到的纸质表格中检测…☆21Oct 12, 2021Updated 4 years ago
- Official code for ''RAG Meets Temporal Graphs: Time-Sensitive Modeling and Retrieval for Evolving Knowledge''.☆32Feb 25, 2026Updated 2 months ago
- [CVPR 26] MarkushGrapher-2: End-to-end Multimodal Recognition of Chemical Structures☆58Apr 24, 2026Updated last month
- HW accelerated h264 encoding on the Allwinner V3s w/ mainline Linux☆24Jul 27, 2023Updated 2 years ago
- Defect-GLM:A Large Visual-Language Model for Industrial Defect Monitoring|首个用于工业缺陷监测的开源大规模视觉语言模型☆107Sep 21, 2024Updated last year
- ☆14Jan 14, 2020Updated 6 years ago
- ☆38Feb 11, 2026Updated 3 months ago
- Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练,生成模型,扩散模型☆17Nov 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- XML Editor is an online web-based tool, designed to create, view, format, edit, save and share xml file. This tool provides multiple feat…☆16Oct 23, 2021Updated 4 years ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 8 months ago
- ☆10Feb 17, 2024Updated 2 years ago
- ☆23Mar 17, 2026Updated 2 months ago
- 表格检测和表结构识别☆24Dec 5, 2022Updated 3 years ago
- Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …☆13Jun 24, 2024Updated last year
- Bird's Eye View Calibration Toolkit☆19Jun 21, 2025Updated 11 months ago
- ☆10Nov 1, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 10 months ago
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- Concept-based generative models☆12Dec 13, 2024Updated last year
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- A basic set of akka helper java classes and examples such as map reduce☆31Sep 21, 2012Updated 13 years ago
- ☆26Dec 10, 2024Updated last year