ecatkins / xpdf_pythonLinks
Python wrapper for xpdf
☆19Updated 5 years ago
Alternatives and similar repositories for xpdf_python
Users that are interested in xpdf_python are comparing it to the libraries listed below
Sorting:
- ☆22Updated 6 years ago
- Text preprocessing tools in python.☆27Updated 7 years ago
- Named entity recognition for the legal domain☆42Updated 4 years ago
- Language detection using Spacy and Fasttext☆57Updated last year
- Calculate readability scores☆42Updated 6 years ago
- Extract dates from text☆64Updated 4 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated 2 years ago
- Text summarization using spacy☆22Updated 2 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- A simple library for segmenting legal texts☆17Updated 2 years ago
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆28Updated 6 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- ☆16Updated last year
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 3 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- An NLP pipeline for COVID-19 surveillance used in the Department of Veterans Affairs Biosurveillance.☆16Updated 2 years ago
- ☆19Updated 3 years ago
- Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼☆22Updated 5 months ago
- Cython wrapper on Hunspell Dictionary☆23Updated last year
- Streamlit component for Jina neural search☆41Updated 3 years ago
- ☆19Updated 3 years ago