ecatkins / xpdf_python
Python wrapper for xpdf
☆19Updated 5 years ago
Alternatives and similar repositories for xpdf_python:
Users that are interested in xpdf_python are comparing it to the libraries listed below
- Language detection using Spacy and Fasttext☆54Updated last year
- Python version of the SymSpell Compound algorithm☆12Updated 6 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- Calculate readability scores☆40Updated 5 years ago
- ☆20Updated 2 years ago
- Extract dates from text☆64Updated 4 years ago
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆56Updated 5 years ago
- Tool for sentiment analysis annotation☆12Updated 3 months ago
- ☆32Updated 6 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- sample code for tech blog post "Porting Flask to FastAPI for ML Model Serving"☆29Updated last year
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- python-docx run manipulation☆21Updated 3 years ago
- Hybrid architecture media server, media service and Streamlit client app using FastAPI and Python☆13Updated 2 years ago
- Text preprocessing tools in python.☆26Updated 6 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- ☆15Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Text analysis for automatic bookmarking/keyword extraction☆18Updated 8 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- Demo application for the PyData Conference in Amsterdam 2018☆30Updated 6 years ago
- Collection of code snippets and utilities for streamlit apps☆22Updated 4 years ago
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year