ocr,pdf转docx,pdf to docx
☆23Nov 4, 2022Updated 3 years ago
Alternatives and similar repositories for pdf_to_docx
Users that are interested in pdf_to_docx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include …☆28May 17, 2023Updated 3 years ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆59Apr 28, 2023Updated 3 years ago
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- 深度网络实现意图分类。☆11Feb 26, 2021Updated 5 years ago
- 利用llm大语言模型提取卡证票据关键信息。Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from …☆15Jul 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MacBERT for Chinese Spelling Correction, macbert中文拼写纠错☆16May 23, 2022Updated 4 years ago
- Title and keywords are used to generate text.☆12Dec 6, 2021Updated 4 years ago
- ☆13Jun 10, 2025Updated last year
- 利用java对文章进行分析并图谱化展示(主要提取关键词、实体、依存分析等)。☆12Apr 14, 2023Updated 3 years ago
- 利用分类法和敏感词检测法对生成式大模型的输入和输出内容进行安全检测,尽早识别风险内容。The input and output contents of generative large model are checked by classification method a…☆28Sep 9, 2024Updated last year
- pdf multimodal rag 【pdf多模态rag问答】☆28Feb 26, 2025Updated last year
- albert-fc for RE(Relation Extraction),中文关系抽取☆20Apr 24, 2023Updated 3 years ago
- model2onnx,将roberta和macbert模型转为onnx格式,并进行推理。☆19Jul 13, 2022Updated 3 years ago
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆27Feb 23, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 利用sklearn和gensim中的tfidf,lsa,doc2vec进行查询与文档匹配搜索☆21Sep 11, 2022Updated 3 years ago
- albert-fc for LP(Link Prediction),中文实体链接预测☆19Apr 21, 2023Updated 3 years ago
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 4 years ago
- ☆13Mar 16, 2021Updated 5 years ago
- text security audit 安全审核-语义模型过滤 敏感内容检测系统☆39Feb 14, 2025Updated last year
- pytorch大规模数据读取dataset☆13May 30, 2022Updated 4 years ago
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆21Aug 23, 2025Updated 9 months ago
- chinese sentence punctuation prediction,中文句子标点符号预测。☆29Oct 19, 2022Updated 3 years ago
- shot_boundary_detection☆10Nov 26, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fire point detection project maintenance. It provides two fire point detection methods: visible light and infrared, with high detection a…☆13Jul 2, 2024Updated last year
- pdf invoice parser,pdf-ofd发票解析。☆44Jul 15, 2024Updated last year
- gcn for prediction of protein interactions,图卷积用于蛋白质相互作用。☆24Nov 15, 2021Updated 4 years ago
- Track and blur any object or person in a video.☆15Feb 10, 2024Updated 2 years ago
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边 增强(DocBleach / TextOrientationCorrection / DocSha…☆133Aug 27, 2024Updated last year
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 4 years ago
- ☆14Dec 9, 2023Updated 2 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition