mkahn5 / translate-book
☆16Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for translate-book
- Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents☆12Updated 2 years ago
- Markdown -> IPython conversion tool☆15Updated 9 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 5 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Find which links on a web page are pagination links☆29Updated 7 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 6 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated 3 weeks ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 10 years ago
- Python crawler (using Scrapy) that uses Pa11y to check accessibility of pages as it crawls.☆17Updated 5 years ago
- ☆12Updated 5 years ago
- Feature set algebra for linguistics☆18Updated last year
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 6 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Simple tools for summarizing .mbox email archives.☆10Updated 4 years ago
- Where I keep my Python notes for starting projects☆9Updated last year
- Turn your IPython console into a cross-database SQL client☆31Updated 8 years ago
- Extract data from an HTML table and store results to a csv file.☆38Updated 9 years ago
- A python autocompletion library. Easycomplete has a simple API and utilizes google's autocomplete results & the english dictionary for no…☆40Updated 11 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆32Updated last year
- ☆26Updated 6 years ago
- Stylometric framework in Python☆13Updated 9 years ago
- Finds linguistic patterns effortlessly☆33Updated last year
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 6 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated this week
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆11Updated last year
- Attempts to determine the natural language of a selection of Unicode (utf-8) text (a clone of http://code.google.com/p/guess-language wit…☆47Updated 14 years ago
- A web application for exploring documents topically.☆26Updated 8 years ago
- vIPer: a new tool for IPython notebooks.☆60Updated 9 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 6 months ago