JonathanRaiman / epub_conversionLinks
Python package for converting xml and epubs to text files
☆34Updated 5 years ago
Alternatives and similar repositories for epub_conversion
Users that are interested in epub_conversion are comparing it to the libraries listed below
Sorting:
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- A python module that will check for package updates.☆28Updated 4 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last week
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- A maximum-strength name parser for record linkage.☆37Updated last month
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- A Flask webapp that categorizes Outlook emails using machine learning☆15Updated 9 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated last month
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Python library to infer date format from examples☆43Updated 3 years ago
- A framework-agnostic datasets library for Machine Learning research and education.☆18Updated 2 years ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- A utility tool to automate certain tasks with Jupyter notebooks.☆9Updated last year
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated last month
- Literature card game implementation: https://en.wikipedia.org/wiki/Literature_(card_game)☆10Updated last year
- A scraping Master-slave system based on Google App Engine☆11Updated 4 years ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆108Updated 2 years ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆15Updated 7 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago