Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, articles, and academic papers. Utilizing cutting-edge Deep Learning algorithms, Scanipy transforms your PDFs into a treasure trove of extractable information: tables, images, equations, and text.
☆19Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for Scanipy
Users that are interested in Scanipy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated last year
- NLP对抗训练,包括PGD、FGM、FGSM、FreeAT☆21Apr 28, 2022Updated 4 years ago
- 使用Bert-BiLstm-CRF做中文命名实体识别,使用的数据集来自https://aistudio.baidu.com/aistudio/competition/detail/802/0/datasets☆18Mar 1, 2024Updated 2 years ago
- 基于图神经网络的一个天气推荐系统☆13Jul 22, 2023Updated 2 years ago
- Code for the paper "Knowledge Base Completion for Constructing Problem-Oriented Medical Records" at MLHC 2020☆11Jun 8, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Examples and tutorials on accessing environmental data and developing and deploying ecological modelling workflows☆19Apr 17, 2026Updated 2 weeks ago
- Source code of the Bgee pipeline used to build the Bgee database☆12Apr 29, 2026Updated last week
- Placeholder repository☆15Mar 16, 2022Updated 4 years ago
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Exercise for Investment and Portfolio Management Specialization offered by Rice on Coursera.☆11Aug 15, 2020Updated 5 years ago
- codes for paper "AttCAT: Explaining Transformers via Attentive Class Activation Tokens"☆12May 13, 2024Updated last year
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- NichePy: a collection of Python scripts for estimating overlap of ecological niche and species distribution models.☆15Dec 5, 2011Updated 14 years ago
- ☆12Jun 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Dec 7, 2025Updated 4 months ago
- OBSOLETE: Prototype Neo4j Knowledge Graph for Coronavirus outbreaks (see NEW VERSION: https://github.com/covid-19-net/covid-19-community)☆18Nov 25, 2020Updated 5 years ago
- PDF viewer library for Gio☆16Mar 17, 2022Updated 4 years ago
- Ner with Bert☆16Apr 24, 2019Updated 7 years ago
- ☆10Jun 28, 2015Updated 10 years ago
- A Python Package for Adaptive Spatio-Temporal Exploratory Model (AdaSTEM)☆26Feb 19, 2026Updated 2 months ago
- 通过聚类分析交易流水检测异常交易☆19Nov 3, 2023Updated 2 years ago
- Free PDF creator in pure Golang☆13Apr 9, 2026Updated 3 weeks ago
- Add a Watermark image to your video record, prepend or append an intro/outro movie, in realtime☆13Oct 7, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python package for Natural Language Processing (NLP), focused on low-resource languages spoken in Mexico.☆24Sep 4, 2025Updated 8 months ago
- 知数云 MJ画图demo,调用 Midjourney Imagine API 进 行画图☆13Jun 2, 2023Updated 2 years ago
- 可以随机生成制定数量的车牌号,因为用到停车场的虚假数据生成,所以地区集中在一个地方。支持各类车辆的生成,只需在注释的地方修改即可。☆10May 30, 2021Updated 4 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆17Feb 7, 2019Updated 7 years ago
- 字体反爬、字体混淆工具是一个用于混淆字体文件的工具,可以将字体文件中的字形进行混淆,从而防止字体文件被直接提取出来。Font Obfuscator is an open-source Python library designed to prevent web scrapi…☆14Feb 11, 2026Updated 2 months ago
- A tool for semantics-based annotation and composition of biosimulation models☆20Jan 6, 2023Updated 3 years ago
- 学习肉丝大佬的逆向笔记☆17Sep 17, 2020Updated 5 years ago
- Backtesting an RSI Trading Algorithm with Quantopian Zipline and Pyfolio Python Libraries☆21Oct 2, 2020Updated 5 years ago
- Official repository of "Neural Machine Translating from Natural Language to SPARQL"☆17Dec 15, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- go-ocr 是一款基于 Golang + ONNX 构建的 OCR 工具库,专注于为 Go 生态提供简单易用、可扩展的文字识别能力。☆59Jan 26, 2026Updated 3 months ago
- 封装Microsoft.Ink为C++动态库,可供其他语言调用手写识别☆12Feb 4, 2021Updated 5 years ago
- go 实现一个类似的v8☆13Oct 12, 2024Updated last year
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆28Sep 25, 2023Updated 2 years ago
- Weakly-supervised Text Classification Based on Keyword Graph☆23Jan 8, 2023Updated 3 years ago
- A wails template using TypeScript + Quasar V2 (Vue 3, Vite, Sass, Pinia, ESLint, Prettier, Composition API with <script setup>)☆17Jun 26, 2023Updated 2 years ago