Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, articles, and academic papers. Utilizing cutting-edge Deep Learning algorithms, Scanipy transforms your PDFs into a treasure trove of extractable information: tables, images, equations, and text.
☆19Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for Scanipy
Users that are interested in Scanipy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Sep 9, 2024Updated last year
- Code of the paper “A Fin-BERT-based Event Extraction Method for Chinese Financial Domain”☆12May 22, 2024Updated last year
- Code for the paper "Reinforced Abstractive Summarization with Adaptive Length Controlling".☆11May 13, 2022Updated 3 years ago
- Code for WWW 2023 paper "HISum: Hyperbolic Interaction Model for Extractive Multi-Document Summarization".☆14Nov 22, 2023Updated 2 years ago
- Some tips on paper writing skills.☆15May 25, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Question Answering System for Domain Knowledge Graphs☆11Feb 24, 2022Updated 4 years ago
- hierarchical convolutional attention networks for text classification☆16Aug 1, 2019Updated 6 years ago
- Examples and tutorials on accessing environmental data and developing and deploying ecological modelling workflows☆19Updated this week
- Re-implementation of Bi- (or, Dual-) encoder for Entity Linking. You can run experiments only in 3 seconds.☆11Jun 12, 2023Updated 2 years ago
- ☆15May 20, 2018Updated 7 years ago
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- codes for paper "AttCAT: Explaining Transformers via Attentive Class Activation Tokens"☆12May 13, 2024Updated last year
- This repo is used to release the ArxivFormula dataset.☆35Nov 12, 2024Updated last year
- ☆12Jun 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- OBSOLETE: Prototype Neo4j Knowledge Graph for Coronavirus outbreaks (see NEW VERSION: https://github.com/covid-19-net/covid-19-community)☆18Nov 25, 2020Updated 5 years ago
- ☆19Jun 11, 2018Updated 7 years ago
- Ner with Bert☆16Apr 24, 2019Updated 6 years ago
- Data for ArXiv 2024 paper "Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study".☆23Mar 10, 2024Updated 2 years ago
- Add a Watermark image to your video record, prepend or append an intro/outro movie, in realtime☆13Oct 7, 2021Updated 4 years ago
- 使用Pytorch的卷积神经网络图片验证码识别小栗子☆11Oct 14, 2024Updated last year
- A generic entity retrieval service for linked data. Contains presets to replicate the DBpedia Lookup service.☆51Feb 12, 2025Updated last year
- 字体反爬、字体混淆工具是一个用于混淆字体文件的工具,可以将字体文件中的字形进行混淆,从而防止字体文件被直接提取出来。Font Obfuscator is an open-source Python library designed to prevent web scrapi…☆13Feb 11, 2026Updated 2 months ago
- [ACL'23 Findings] This is the code repo for our ACL'23 Findings paper "ReGen: Zero-Shot Text Classification via Training Data Generation …☆24Sep 8, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotation…☆12Aug 13, 2024Updated last year
- 学 习肉丝大佬的逆向笔记☆17Sep 17, 2020Updated 5 years ago
- Backtesting an RSI Trading Algorithm with Quantopian Zipline and Pyfolio Python Libraries☆21Oct 2, 2020Updated 5 years ago
- Official repository of "Neural Machine Translating from Natural Language to SPARQL"☆17Dec 15, 2020Updated 5 years ago
- 封装Microsoft.Ink为C++动态库,可供其他语言调用手写识别☆12Feb 4, 2021Updated 5 years ago
- go 实现一个类似的v8☆13Oct 12, 2024Updated last year
- Geometry-aware Multilingual Embeddings☆26Dec 8, 2022Updated 3 years ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆28Sep 25, 2023Updated 2 years ago
- Convert LaTeX-OCR To ONNX☆14Apr 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Weakly-supervised Text Classification Based on Keyword Graph☆23Jan 8, 2023Updated 3 years ago
- Data for EACL 2023 paper "A Survey on Recent Advances in Keyphrase Extraction from Pre-trained Language Models".☆45Dec 23, 2023Updated 2 years ago
- Probabilistic Matrix Factorization in PyTorch☆22Feb 21, 2019Updated 7 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆28Oct 24, 2022Updated 3 years ago
- A simple, efficient, quantum computer simulator.☆31May 24, 2021Updated 4 years ago
- ☆20May 17, 2023Updated 2 years ago
- A wails template using TypeScript + Quasar V2 (Vue 3, Vite, Sass, Pinia, ESLint, Prettier, Composition API with <script setup>)☆17Jun 26, 2023Updated 2 years ago