utunga / sentence_diffLinks
Difference English sentences via Liechtenstein distance, calculate word error rate, and list out word by word differences
☆10Updated 5 years ago
Alternatives and similar repositories for sentence_diff
Users that are interested in sentence_diff are comparing it to the libraries listed below
Sorting:
- Extract dates from text☆64Updated 4 years ago
- Automate The Boring Stuff: Updating WordPress☆12Updated 4 years ago
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- Get list of common stop words in various languages in Python☆156Updated last year
- A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them☆69Updated 2 years ago
- PyDictionary is a Dictionary Module for Python 2/3 to get meanings, translations, synonyms and antonyms of words☆281Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated last year
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated last year
- Parse numbers written in natural language☆119Updated 8 months ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- 📂 Additional lookup tables and data resources for spaCy☆107Updated last month
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 7 months ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- Pythonic search engine based on PyLucene.☆128Updated 8 months ago
- remove signature blocks from emails☆86Updated 6 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated last month
- Library to populate items using XPath and CSS with a convenient API☆48Updated 3 weeks ago
- French language support for TextBlob.☆59Updated 5 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆151Updated 5 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated last year
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 12 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Detect Language API Python Client☆70Updated 3 years ago
- Original, standard and customisable versions of the Jaro-Winkler functions.☆31Updated 2 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- CSS related utilities (parsing, serialization, etc) for python☆32Updated 10 months ago
- extract data from html table☆87Updated 5 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Spanish rule-based lemmatization for spaCy☆40Updated 3 years ago