视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答
☆44Dec 31, 2024Updated last year
Alternatives and similar repositories for guidance-ocr
Users that are interested in guidance-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding☆12Jun 26, 2022Updated 3 years ago
- ☆19Dec 6, 2023Updated 2 years ago
- The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. …☆17Jul 8, 2024Updated last year
- Code for "DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets", accepted at Neurips 2023 (Main confer…☆27Mar 29, 2024Updated 2 years ago
- 陆续开源医疗行业的深度学习模型及数据集☆13Dec 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆145May 15, 2024Updated last year
- chinese few-shot ner☆16Aug 28, 2022Updated 3 years ago
- ☆12Oct 12, 2023Updated 2 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- ☆37Apr 1, 2026Updated 2 weeks ago
- 前端一键集成WPS加载项☆11Nov 9, 2022Updated 3 years ago
- Script to import data from the The Movie Database to PostgreSQL (Dataset URL: https://www.kaggle.com/rounakbanik/the-movies-dataset☆11Mar 20, 2020Updated 6 years ago
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆47Oct 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated 10 months ago
- wordmaker是一个自动批量生成word的GUI工具,根据自定义模板生成批量的Word文档,支持WPS.☆16Jun 6, 2023Updated 2 years ago
- A Django App to test and play around with Langchain features☆11Feb 28, 2024Updated 2 years ago
- Dive into LLM Agents☆18Jun 1, 2024Updated last year
- 🎉🎨 This repository contains a reading list of papers on Embodied AI, including LLM/MLLM/VLA.☆13Aug 18, 2025Updated 8 months ago
- ☆16Jun 19, 2022Updated 3 years ago
- 一个基于多模态大模型的图表解析器☆44Mar 28, 2025Updated last year
- code for the SIGGRAPH 2025 paper "Computational Modeling of Gothic Microarchitecture"☆14Apr 25, 2025Updated 11 months ago
- Minimal AG-UI Starter Stack w/ LangGraph & 🪁 CopilotKit☆32Oct 24, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Source code of SAP ABAP☆17Jun 9, 2024Updated last year
- ☆18Oct 20, 2023Updated 2 years ago
- 使用LnagChain+FastAPI+Vue,搭建一個可以上傳並讀取PDF回答問題的LineBot。☆17Updated this week
- [ICLR 2025] Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting☆14Nov 24, 2025Updated 4 months ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- go语言版本智能客服项目,速度比python快200%,非常稳定可用,智能客服,搭建本地知识库,智能问答。 Go language version smart customer service project, 200% faster than pyth…☆16Apr 30, 2024Updated last year
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆42Mar 23, 2025Updated last year
- 根据docx模板,一键批量填充字段并合成新的word文档☆18Nov 28, 2023Updated 2 years ago
- ☆11Nov 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- "FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding", NeurIPS 2023 Datasets and Benchmarks Track☆12Jun 20, 2024Updated last year
- 🔬 ArXiv论文智能解读助手 - Arxiv-MCP-Server, 支持MCP协议的学术论文一键下载、解析、翻译为中文,并生成微信公众号文章格式☆39Jun 16, 2025Updated 10 months ago
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆821Updated this week
- A Chinese characters recognition repository with tensorrt format supported based on CRNN_Chinese_Characters_Rec and TensorRTx.☆18Mar 11, 2021Updated 5 years ago
- 通过人脸识别定位身份证获取身份证号☆18Feb 22, 2018Updated 8 years ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Mar 26, 2024Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago