Easy PDF to text to spaCy text extraction in Python.
☆40Dec 29, 2025Updated 2 months ago
Alternatives and similar repositories for spacypdfreader
Users that are interested in spacypdfreader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Scraping project for extracting FAQ and Help center articles☆13Dec 8, 2023Updated 2 years ago
- ☆23Aug 13, 2023Updated 2 years ago
- ☆21Dec 4, 2024Updated last year
- Datasets and functions for the Handbook of Educational Measurement and Psychometrics using R.☆24Apr 2, 2021Updated 4 years ago
- Finds linguistic patterns effortlessly☆39Aug 29, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Oct 23, 2020Updated 5 years ago
- make variables remember their history☆15Jun 2, 2020Updated 5 years ago
- This ohsome R package grants access to the power of the ohsome API from R.☆12Oct 4, 2023Updated 2 years ago
- The Polars library has emerged as the most sought-after Python package in data science, owing to its impressive speed and dplyr-like synt…☆13Sep 27, 2024Updated last year
- Data Policies in top economics journals.☆12Jun 21, 2022Updated 3 years ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆13Dec 7, 2023Updated 2 years ago
- NCCS data platform powered by Jekyll☆11Mar 4, 2026Updated 3 weeks ago
- R package: find footballers' common team mates☆14Feb 6, 2022Updated 4 years ago
- WhatIf: Software for Evaluating Counterfactuals☆18May 17, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 3 months ago
- Digital Research Toolkit for Linguists course materials☆12Jul 23, 2025Updated 8 months ago
- MFAQ: a Multilingual FAQ Dataset☆18Sep 17, 2023Updated 2 years ago
- Emory Language and Information Toolkit☆39Apr 16, 2025Updated 11 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆82Aug 31, 2023Updated 2 years ago
- The notebooks in this repository are recipes for energy and climate modelers. They require open source software and they can run locally …☆19Updated this week
- A Streamlit application to visualize sentence embeddings☆18Dec 21, 2022Updated 3 years ago
- Code for doing Argument Structure Prediction using Residual Networks and (almost) without symbolic features☆11May 24, 2023Updated 2 years ago
- Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources☆12Apr 12, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ShinyApp: Shiny What You See Is What You Get (WYSIWYG) editor☆29Nov 9, 2020Updated 5 years ago
- Create and analyze argument graphs and serialize them via Protobuf☆10Mar 18, 2026Updated last week
- A repository that showcases how you can use ZenML with Git☆75Jan 13, 2026Updated 2 months ago
- Julia implementation of Modal Decision Trees & Forests, for interpretable classification of spatial and temporal data. Long live Symbolic…☆12Updated this week
- Reflective memory for AI agents☆23Mar 20, 2026Updated last week
- Low-carbon Expansion Generation Optimization (LEGO) model☆10Nov 16, 2022Updated 3 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- Language detection using Spacy and Fasttext☆56Dec 17, 2023Updated 2 years ago
- The OpenCitations RDF Resource Browser☆15Oct 29, 2025Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Portal: GUI Tools for Agents☆25Sep 18, 2025Updated 6 months ago
- Fuzzy matching and more functionality for spaCy.☆258Jul 6, 2024Updated last year
- An R package for network centrality☆49Sep 23, 2025Updated 6 months ago
- Rust crate for auto-discovery of feeds in HTML content☆12Dec 14, 2021Updated 4 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22Jan 12, 2026Updated 2 months ago
- ☆13Sep 30, 2025Updated 5 months ago
- A neural RST discourse parser with well pre-trained XLNet.☆17Jun 13, 2022Updated 3 years ago