ocr,pdf转docx,pdf to docx
☆23Nov 4, 2022Updated 3 years ago
Alternatives and similar repositories for pdf_to_docx
Users that are interested in pdf_to_docx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- 智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include …☆27May 17, 2023Updated 2 years ago
- TABLE DETECTION IN IMAGES AND OCR TO CSV WITH JAVA☆10Jul 18, 2023Updated 2 years ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆58Apr 28, 2023Updated 2 years ago
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 深度网络实现意图分类。☆11Feb 26, 2021Updated 5 years ago
- textcnn for advertising detection,广告检测☆11Jan 12, 2024Updated 2 years ago
- MacBERT for Chinese Spelling Correction, macbert中文拼写纠错☆16May 23, 2022Updated 3 years ago
- ☆14Jun 10, 2025Updated 9 months ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆23May 29, 2025Updated 10 months ago
- 利用分类法和敏感词检测法对生成式大模型的输入和输出内容进行安全检测,尽早识别风险内容。The input and output contents of generative large model are checked by classification method a…☆28Sep 9, 2024Updated last year
- model2onnx,将roberta和macbert模型转为onnx格式,并进行推理。☆19Jul 13, 2022Updated 3 years ago
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆28Feb 23, 2024Updated 2 years ago
- 利用sklearn和gensim中的tfidf,lsa,doc2vec进行查询与文档匹配搜索☆21Sep 11, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- onnx-java,这里利用java加载onnx模型,并进行推理。☆22May 19, 2022Updated 3 years ago
- albert-fc for LP(Link Prediction),中文实体链接预测☆19Apr 21, 2023Updated 2 years ago
- ☆13Mar 16, 2021Updated 5 years ago
- pytorch大规模数据读取dataset☆13May 30, 2022Updated 3 years ago
- text security audit 安全审核-语义模型过滤 敏感内容检测系统☆38Feb 14, 2025Updated last year
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆20Aug 23, 2025Updated 7 months ago
- Track and blur any object or person in a video.☆14Feb 10, 2024Updated 2 years ago
- 题目知识点预测标注。Question knowledge point prediction.☆22Sep 11, 2022Updated 3 years ago
- chinese sentence punctuation prediction,中文句子标点符号预测。☆29Oct 19, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- shot_boundary_detection☆10Nov 26, 2019Updated 6 years ago
- Official implementation of paper "Vision Graph Prompting via Semantic Low-Rank Decomposition", ICML 2025☆16Dec 25, 2025Updated 3 months ago
- Fire point detection project maintenance. It provides two fire point detection methods: visible light and infrared, with high detection a…☆12Jul 2, 2024Updated last year
- An implementation similar to a blind watermark.☆11Sep 13, 2022Updated 3 years ago
- pdf invoice parser,pdf-ofd发票解析。☆40Jul 15, 2024Updated last year
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- 根据维基百科历史编辑数据提取纠错语料。☆12Apr 6, 2022Updated 3 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year
- ☆13Sep 25, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Dec 9, 2023Updated 2 years ago
- 本项目利用JNI加载paddle-ocr的C++编译的dll库,并利用springboot进行web部署访问。This project uses JNI to load the C++ compiled dll libraries of paddle-ocr, and us…☆37Dec 30, 2024Updated last year
- This project used Yolov8/AnimeGAN and Flask to accomplish the task of background segmentation , background remove and background replacem…☆12Apr 12, 2024Updated last year
- ☆28Jul 16, 2024Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- ☆16Jul 5, 2023Updated 2 years ago
- A polygon detector based on obb-yolov3 (WIP)☆17Jul 21, 2021Updated 4 years ago