Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, articles, and academic papers. Utilizing cutting-edge Deep Learning algorithms, Scanipy transforms your PDFs into a treasure trove of extractable information: tables, images, equations, and text.
☆19Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for Scanipy
Users that are interested in Scanipy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Sep 9, 2024Updated last year
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated 2 years ago
- Table Extraction including Table Detection and Table Structure Recognition using Table Transformers Microsoft☆15Oct 13, 2022Updated 3 years ago
- NLP对抗训练,包括PGD、FGM、FGSM、FreeAT☆21Apr 28, 2022Updated 4 years ago
- 使用Bert-BiLstm-CRF做中文命名实体识别,使用的数据集来自https://aistudio.baidu.com/aistudio/competition/detail/802/0/datasets☆18Mar 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- hierarchical convolutional attention networks for text classification☆16Aug 1, 2019Updated 6 years ago
- 基于图神经网络的一个天气推荐系统☆13Jul 22, 2023Updated 2 years ago
- Code for the paper "Knowledge Base Completion for Constructing Problem-Oriented Medical Records" at MLHC 2020☆11Jun 8, 2021Updated 4 years ago
- Examples and tutorials on accessing environmental data and developing and deploying ecological modelling workflows☆19Apr 17, 2026Updated last month
- Placeholder repository☆15Mar 16, 2022Updated 4 years ago
- stochastic neural networks in R☆25Feb 27, 2018Updated 8 years ago
- Exercise for Investment and Portfolio Management Specialization offered by Rice on Coursera.☆11Aug 15, 2020Updated 5 years ago
- codes for paper "AttCAT: Explaining Transformers via Attentive Class Activation Tokens"☆12May 13, 2024Updated 2 years ago
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jun 28, 2023Updated 2 years ago
- ☆19Jun 11, 2018Updated 7 years ago
- PDF viewer library for Gio☆16Mar 17, 2022Updated 4 years ago
- Ner with Bert☆16Apr 24, 2019Updated 7 years ago
- A Python Package for Adaptive Spatio-Temporal Exploratory Model (AdaSTEM)☆26Feb 19, 2026Updated 3 months ago
- Free PDF creator in pure Golang☆13Apr 9, 2026Updated last month
- Python package for Natural Language Processing (NLP), focused on low-resource languages spoken in Mexico.☆24Sep 4, 2025Updated 8 months ago
- 知数云 MJ画图demo,调用 Midjourney Imagine API 进行画图☆13Jun 2, 2023Updated 2 years ago
- 字体反爬、字体混淆工具是一个用于混淆字体文件的工具,可以将字体文件中的字形进行混淆,从而防止字体文件被直接提取出来。Font Obfuscator is an open-source Python library designed to prevent web scrapi…☆14Feb 11, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tool for semantics-based annotation and composition of biosimulation models☆20Jan 6, 2023Updated 3 years ago
- LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotation…☆12Aug 13, 2024Updated last year
- 学习肉丝大佬的逆向笔记☆17Sep 17, 2020Updated 5 years ago
- Official repository of "Neural Machine Translating from Natural Language to SPARQL"☆17Dec 15, 2020Updated 5 years ago
- A library for static attributed graph outlier detection. 静态属性图节点的异常检测模型集。☆31Nov 19, 2023Updated 2 years ago
- 封装Microsoft.Ink为C++动态库,可供其他语言调用手写识别☆12Feb 4, 2021Updated 5 years ago
- go 实现一个类似的v8☆13Oct 12, 2024Updated last year
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- Weakly-supervised Text Classification Based on Keyword Graph☆23Jan 8, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert LaTeX-OCR To ONNX☆14Apr 2, 2024Updated 2 years ago
- A simple, efficient, quantum computer simulator.☆31May 24, 2021Updated 5 years ago
- Windows Toast Notifications☆16Mar 21, 2025Updated last year
- ☆20May 17, 2023Updated 3 years ago
- Entity Linking in Queries: Tasks and Evaluation☆33Sep 13, 2023Updated 2 years ago
- Given a Wikipedia article, generate N "good" questions and answer N questions.☆15May 12, 2017Updated 9 years ago
- 本项目演示如何在PyTorch中使用Transformer模型进行中文文本分类☆39Mar 21, 2023Updated 3 years ago