jessepollak / urlmatchLinks
π₯ A Python library for easily pattern matching wildcard URLs
β40Updated 8 years ago
Alternatives and similar repositories for urlmatch
Users that are interested in urlmatch are comparing it to the libraries listed below
Sorting:
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificatiβ¦β102Updated 2 years ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.β83Updated 4 years ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.β55Updated 8 months ago
- CSS related utilities (parsing, serialization, etc) for pythonβ32Updated last week
- URL normalization for Pythonβ98Updated 5 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.β190Updated 3 years ago
- This package lets your server send Web Push Notifications to your clients. No special web framework is required (such as Django, Flask, Pβ¦β29Updated 2 years ago
- A Python binding of SQLite Full Text Search Tokenizerβ48Updated 2 weeks ago
- Fast and robust date extraction from web pages, with Python or on the command-lineβ141Updated last month
- Modern robots.txt Parser for Pythonβ196Updated last year
- Python WSGI Middleware for adding HTTP/S proxy support to any WSGI Applicationβ24Updated 4 years ago
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified databaseβ¦β58Updated 2 years ago
- Python package for HTTP/1.1 style headers. Parse headers to objects. Most advanced available structure for http headers.β117Updated 3 weeks ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.β151Updated 5 years ago
- Accurately find/replace/remove emojis in text stringsβ162Updated last year
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β23Updated 3 months ago
- A scrapy middleware to use rotated proxy ip list.β24Updated 7 years ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enoughβ286Updated 3 weeks ago
- π A tool to get email addresses by action types such as `starred`, `watching` or `fork` on GitHub repositories; Sending email content toβ¦β90Updated 4 years ago
- Fast multi-keyword search engine for text stringsβ257Updated last year
- β70Updated 2 years ago
- Extract text from HTMLβ134Updated 5 years ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.β268Updated last year
- Lightning Fast Language Prediction πβ167Updated last month
- THIS REPOSITORY IS FORKβ30Updated 2 years ago
- CLI based diff viewerβ23Updated 4 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.β98Updated 4 years ago
- YAML-formatted plain-text file based models for Flask backed by Flask-SQLAlchemyβ23Updated 8 months ago
- Find which links on a web page are pagination linksβ29Updated 8 years ago