JonathanRaiman / epub_conversionLinks
Python package for converting xml and epubs to text files
☆33Updated 5 years ago
Alternatives and similar repositories for epub_conversion
Users that are interested in epub_conversion are comparing it to the libraries listed below
Sorting:
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- A python module that will check for package updates.☆28Updated 4 years ago
- A maximum-strength name parser for record linkage.☆38Updated 2 months ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆153Updated last month
- Python wrapper for a C++ Double Metaphone☆15Updated this week
- Python library to infer date format from examples☆45Updated 3 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 3 months ago
- Extract text from HTML☆134Updated 5 years ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆108Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 4 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- Twitter Discovery: Search articles referenced in your tweets, retweets, and favorites☆16Updated 5 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- 🏃♀️ Minimalistic CLI Tool for Managing and Running Bash Snippets☆37Updated 5 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 2 months ago
- Markov chain generator for Python and/or Swift☆66Updated 3 years ago
- A small wrapper around python logging module which can easily format and write logs to file.☆12Updated 2 years ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- ipython + REPL + coroutines - suffering☆19Updated last year
- Python package + CLI to generate wordclouds of Twitter tweets.☆77Updated 5 years ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆151Updated 5 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago