hzauzxb / guidance-ocrView external linksLinks
视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答
☆44Dec 31, 2024Updated last year
Alternatives and similar repositories for guidance-ocr
Users that are interested in guidance-ocr are comparing it to the libraries listed below
Sorting:
- 中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding☆12Jun 26, 2022Updated 3 years ago
- ☆21Feb 26, 2024Updated last year
- ☆19Dec 6, 2023Updated 2 years ago
- ☆34Dec 18, 2025Updated last month
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆39Jun 4, 2025Updated 8 months ago
- ☆27Jul 18, 2023Updated 2 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 3 years ago
- 使用bert进行中文方面级情感识别。☆25Jun 26, 2023Updated 2 years ago
- [SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…☆32Nov 22, 2025Updated 2 months ago
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆43Oct 22, 2024Updated last year
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆42Mar 23, 2025Updated 10 months ago
- Backtesting fbprophet prediction of Silver prices for 2017☆14Nov 29, 2017Updated 8 years ago
- 机器学习使用过的API中文版及机器学习的理论知识☆13Jun 8, 2025Updated 8 months ago
- 一个基于多模态大模型的图表解析器☆43Mar 28, 2025Updated 10 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆44Sep 24, 2024Updated last year
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29May 10, 2024Updated last year
- Integrates search APIs with GPT models for real-time web access, enabling intelligent Q&A and information retrieval similar to New Bing. …☆42Jul 11, 2024Updated last year
- ☆22Dec 11, 2025Updated 2 months ago
- A tiny reactive dataflow library for scheduling a DAG of async functions in Javascript☆12Oct 22, 2022Updated 3 years ago
- ☆18Feb 16, 2025Updated last year
- ☆11Oct 31, 2024Updated last year
- Implementation of the TFHE homomorphic encryption scheme.☆12May 14, 2021Updated 4 years ago
- 作者:qq820629211,1656724967☆11Jan 20, 2020Updated 6 years ago
- wordmaker是一个自动批量生成word的GUI工具,根据自定义模板生成批量的Word文档,支持WPS.☆15Jun 6, 2023Updated 2 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 10 months ago
- deploy machine learning model in tensorflow sering and docker☆10Dec 5, 2018Updated 7 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- 前端一键集成WPS加载项☆11Nov 9, 2022Updated 3 years ago
- ☆11Jul 24, 2023Updated 2 years ago
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated 10 months ago
- Object tracking using pyqt5 and opencv3☆10Feb 23, 2018Updated 7 years ago
- 3位代码类目表;6位扩展代码表;疾病分类与代码(修订版);章节名称及代码☆11Aug 20, 2018Updated 7 years ago
- HGFM : A Hierarchical Grained and Feature Model for Acoustic Emotion Recgnition☆11Oct 30, 2020Updated 5 years ago
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆10Nov 17, 2021Updated 4 years ago
- Zero-overhead destructors in C☆10Sep 18, 2018Updated 7 years ago
- xyb社区公益用途☆14Jun 3, 2025Updated 8 months ago
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- CUDA code with exact k-NN algorithm for multiple GPU system.☆12Jul 5, 2024Updated last year
- ☆11Nov 27, 2018Updated 7 years ago