视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答
☆44Dec 31, 2024Updated last year
Alternatives and similar repositories for guidance-ocr
Users that are interested in guidance-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. …☆17Jul 8, 2024Updated last year
- 陆续开源医疗行业的深度学习模型及数据集☆13Dec 30, 2021Updated 4 years ago
- TEJ_API_Python_實戰應用☆13Dec 26, 2024Updated last year
- Integrates search APIs with GPT models for real-time web access, enabling intelligent Q&A and information retrieval similar to New Bing. …☆41Jul 11, 2024Updated last year
- using lear to do ner extraction☆29Mar 13, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Aug 20, 2023Updated 2 years ago
- OAuth Login for Gradio. Supports multiple identity providers.☆16Jan 20, 2025Updated last year
- chinese few-shot ner☆16Aug 28, 2022Updated 3 years ago
- StructToken : Rethinking Semantic Segmentation with Structural Prior☆29Nov 17, 2022Updated 3 years ago
- ☆12Oct 12, 2023Updated 2 years ago
- 鲁伟《机器学习公式推导与代码实现》。整体对算法的分类是亮点。算法原理和代码实现也相对简单,可以和《机器学习实战》对比起来看。☆10Oct 19, 2022Updated 3 years ago
- 中文关键词提取☆14Aug 7, 2023Updated 2 years ago
- ☆36Apr 1, 2026Updated 2 months ago
- xyb社区公益用途☆19Jun 3, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆48Oct 22, 2024Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated last year
- A Django App to test and play around with Langchain features☆11Feb 28, 2024Updated 2 years ago
- Dive into LLM Agents☆18Jun 1, 2024Updated 2 years ago
- 🎉🎨 This repository contains a reading list of papers on Embodied AI, including LLM/MLLM/VLA.☆13Aug 18, 2025Updated 10 months ago
- AI基金大师-基于AI的智能投资分析系统,专为中国基金市场设计,统计市场中股票类型的基金,以收益做为唯一评分标准进行分级☆39Nov 8, 2025Updated 7 months ago
- ☆16Jun 19, 2022Updated 4 years ago
- 一个基于多模态大模型的图表解析器☆44Mar 28, 2025Updated last year
- A LINE Bot demo showcasing how to use a local LLM (Gemma) via Groq to modify personal information and detect the need for LLM assistance.☆17Jul 25, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆18Oct 20, 2023Updated 2 years ago
- [SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…☆33Mar 10, 2026Updated 3 months ago
- [ICLR 2025] Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting☆16Nov 24, 2025Updated 7 months ago
- go语言版本智能客服项目,速度比python快200%,非常稳定可用,智能客服,搭建本地知识库,智能问答。 Go language version smart customer service project, 200% faster than pyth…☆16Apr 30, 2024Updated 2 years ago
- 数据挖掘-葡萄酒质量分析☆16Jan 17, 2023Updated 3 years ago
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆44Mar 23, 2025Updated last year
- 自动生成论文模板,避免把时间都浪费在打重复的字和排版上面☆11Apr 12, 2026Updated 2 months ago
- (python) 使用window微信客服端向指定用户/群发送信息☆12Jun 24, 2020Updated 6 years ago
- 🔬 ArXiv论文智能解读助手 - Arxiv-MCP-Server, 支持MCP协议的学术论文一键下载、解析、翻译为中文,并生成微信公众号文章格式☆43Jun 16, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)☆854Jun 12, 2026Updated 2 weeks ago
- A Chinese characters recognition repository with tensorrt format supported based on CRNN_Chinese_Characters_Rec and TensorRTx.☆18Mar 11, 2021Updated 5 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆37Mar 26, 2024Updated 2 years ago
- Official implementation for "Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling"☆10May 21, 2024Updated 2 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 3 years ago
- 论文一体化写作神器(Python)☆17Apr 11, 2020Updated 6 years ago