JonathanRaiman / epub_conversion
Python package for converting xml and epubs to text files
☆34Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for epub_conversion
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- A python module that will check for package updates.☆28Updated 3 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 9 years ago
- A Flask webapp that categorizes Outlook emails using machine learning☆15Updated 9 years ago
- Twitter Discovery: Search articles referenced in your tweets, retweets, and favorites☆15Updated 4 years ago
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated last year
- Graph extraction and NLP analysis for Baleen Corpora☆18Updated 8 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- ☆29Updated 2 years ago
- Markov chain generator for Python and/or Swift☆65Updated 2 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated 10 months ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 2 years ago
- Model drift detection☆11Updated last year
- An easy-to-use Python wrapper for the Don Best Sports Data API.☆16Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- Python Data Collection Library☆46Updated 3 years ago
- Enhance your feature engineering workflow with Kodiak☆20Updated last year
- Datasette plugin for authenticating access using API tokens☆12Updated 2 months ago
- Python library to infer date format from examples☆42Updated 3 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- Automated Jupyter notebook testing. 📙☆41Updated 9 months ago
- Generate reports for spaCy models.☆28Updated 2 years ago