szhowardhuang/VisionAgent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/szhowardhuang/VisionAgent)

szhowardhuang / VisionAgent

基于InternLm chat 7B大模型基座，构建一个Agent ，可以调用 MMYOLO 工具来完成图像内视觉任务

☆11

Alternatives and similar repositories for VisionAgent

Users that are interested in VisionAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

duaibeom / chemOCR
View on GitHub
DB-based Optical Chemical Structure Recognition
☆13Sep 12, 2022Updated 3 years ago
crocs-ifly-ustc / CROCS-Baseline
View on GitHub
baseline method for CROCS 2024
☆10Jan 24, 2024Updated 2 years ago
DS4SD / MarkushGenerator
View on GitHub
[CVPR 25] MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures
☆15Mar 22, 2026Updated 4 months ago
manho30 / douyinapi
View on GitHub
抖音小视频解析API。可获取无水印链接。
☆20May 23, 2024Updated 2 years ago
USTHzhanglu / Maixpy
View on GitHub
some code for use k210 by Maixpy better
☆15May 2, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JZPeterPan / DAS-Medical-Red-Teaming-Agents
View on GitHub
☆19Aug 17, 2025Updated 11 months ago
caigouShaw / table_rec_system
View on GitHub
中文表格OCR识别系统，支持导出excel或者word表格
☆16Sep 17, 2023Updated 2 years ago
elcaiseri / Siamese-Network
View on GitHub
Image similarity estimation using a Siamese Network with a triplet loss
☆11Jul 27, 2023Updated 2 years ago
liteli1987gmail / milvus_docs
View on GitHub
Milvus的中文文档教程
☆15Jul 21, 2024Updated 2 years ago
qunshansj / Enhanced-YOLO-CRNN-Pipeline-Crack-Detection
View on GitHub
基于改进YOLOv7和CRNN的管道裂缝检测系统（源码＆教程）
☆25Dec 4, 2023Updated 2 years ago
ozzyou / RP-FEM
View on GitHub
☆12Dec 6, 2023Updated 2 years ago
KMnO4-zx / paper-agent
View on GitHub
something for paper agent
☆11Dec 18, 2024Updated last year
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
lizhaokun / Table-Extraction-and-Chinese-OCR
View on GitHub
Extract the outline of the table from the paper form obtained from the photo and recognize the text content in the outline. 从拍照得到的纸质表格中检测…
☆21Oct 12, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lalitbhagtani / web-xml-editor
View on GitHub
XML Editor is an online web-based tool, designed to create, view, format, edit, save and share xml file. This tool provides multiple feat…
☆16Oct 23, 2021Updated 4 years ago
JIN-strong / Table-OCR-based-on-DeepLearning
View on GitHub
表格检测和表结构识别
☆24Dec 5, 2022Updated 3 years ago
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
Unturned3 / h264enc_demo
View on GitHub
HW accelerated h264 encoding on the Allwinner V3s w/ mainline Linux
☆26Jul 27, 2023Updated 2 years ago
PongNJ / V-RoAst
View on GitHub
[ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?
☆13Dec 17, 2025Updated 7 months ago
Monash-Civil-CV-Team / FCS-Net
View on GitHub
☆10Nov 1, 2021Updated 4 years ago
BryanGao-1216 / TheSecondYou
View on GitHub
微信恋爱陪伴场景下的中文对话机器人与强化学习流水线，开箱即用地提供数据处理、微调、PPO、Agent 工具链，并预置本地 vLLM 推理接口。帮助打造第二个你
☆16Dec 17, 2025Updated 7 months ago
WH-HuanWang / Defect-GLM
View on GitHub
Defect-GLM：A Large Visual-Language Model for Industrial Defect Monitoring|首个用于工业缺陷监测的开源大规模视觉语言模型
☆108Sep 21, 2024Updated last year
yuanzhongqiao / Industrial-Defect-Diffusion-Model
View on GitHub
Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练，生成模型，扩散模型
☆17Nov 10, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
vaadin / Graph-Explorer
View on GitHub
Highly interactive graph data visualization
☆15Oct 13, 2021Updated 4 years ago
HarrisSK / Knowledge-augmented-LLMs-for-construction-contract-risk-identification
View on GitHub
☆10Feb 17, 2024Updated 2 years ago
PaddlePaddle / tape
View on GitHub
☆14Jan 14, 2020Updated 6 years ago
taolinzhang / 3DVLP
View on GitHub
[AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…
☆13Dec 8, 2024Updated last year
TU-Berlin-DIMA / Condor
View on GitHub
Condor allows for the specification of synopsis-based streaming jobs on top of general dataflow systems. Condor provides a collection of …
☆13Jun 24, 2024Updated 2 years ago
prescient-design / CBGM
View on GitHub
Concept-based generative models
☆12Dec 13, 2024Updated last year
Network-Maritime-Complexity / GLSN-and-international-trade
View on GitHub
This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…
☆10Jul 6, 2023Updated 3 years ago
tusharsircar95 / Data-Science-Hackathons-Analytics-Vidhya
View on GitHub
Sharing my solutions to data science hackathons conducted by Analytics Vidhya
☆11Apr 29, 2018Updated 8 years ago
rtll666 / realtime_vlm_system
View on GitHub
A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligen…
☆49Aug 5, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yuxiaowww / IJCAI-18-TIANCHI
View on GitHub
Rank9 IJCAI-18 阿里妈妈搜索广告转化预测第一赛季
☆10Aug 22, 2018Updated 7 years ago
wenqiglantz / rag-notebook-to-microservices
View on GitHub
The Journey of RAG: From Notebook to Microservices
☆28Feb 22, 2024Updated 2 years ago
zhanggefan / mmdet-yolov4
View on GitHub
☆18May 27, 2021Updated 5 years ago
ludc506 / InternVL-X
View on GitHub
☆16Mar 26, 2025Updated last year
kingsaction / GraphAnalysis
View on GitHub
大规模图数据交互式可视化分析平台
☆14Apr 23, 2018Updated 8 years ago
XieZilongAI / E2E-AFG
View on GitHub
An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
☆16Oct 27, 2024Updated last year
valorheart-20 / TransReID
View on GitHub
☆15Jan 21, 2026Updated 6 months ago