jessepollak / urlmatchLinks
π₯ A Python library for easily pattern matching wildcard URLs
β40Updated 8 years ago
Alternatives and similar repositories for urlmatch
Users that are interested in urlmatch are comparing it to the libraries listed below
Sorting:
- A Python binding of SQLite Full Text Search Tokenizerβ50Updated 2 months ago
- β26Updated last year
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.β57Updated last year
- URL normalization for Pythonβ99Updated 9 months ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified databaseβ¦β59Updated 3 years ago
- Loadable spellfix1 extension for sqlite as python packageβ27Updated last year
- Binary Python bindings for poppler utils for content extractionβ42Updated 4 years ago
- Accurately find/replace/remove emojis in text stringsβ163Updated 2 years ago
- Fast multi-keyword search engine for text stringsβ258Updated last year
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.β84Updated 5 years ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificatiβ¦β102Updated 2 years ago
- β70Updated 3 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.β151Updated 5 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.β18Updated 3 weeks ago
- Extract text from HTMLβ134Updated 2 weeks ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)β68Updated 2 years ago
- A Python library to check for (and clean) profanity in strings.β63Updated 4 years ago
- A helper library full of URL-related heuristics.β73Updated this week
- CoCrawler is a versatile web crawler built using modern tools and concurrency.β193Updated 3 years ago
- Python library for modern thread / multiprocessing pooling and task processing via asyncioβ15Updated 5 years ago
- Predicts likes, comment or total interactions of a facebook page post using machine learningβ10Updated 7 years ago
- Easy Html Parser is an AST generator for html/xml documents. You can easily delete/insert/extract tags in html/xml documents as well as lβ¦β52Updated 6 years ago
- Find which links on a web page are pagination linksβ29Updated 9 years ago
- Scrapy middleware for the autologinβ36Updated this week
- Flatten, format, and export any JSON-like data to CSV (or any other string output).β17Updated 4 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ76Updated 2 weeks ago
- Language detection using Spacy and Fasttextβ57Updated 2 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β23Updated 8 months ago
- β98Updated 7 years ago
- Python Event Driven Systemβ12Updated 4 years ago