dmiro / bagofwords
The main goal this Python module is to provide functions to apply Text Classification.
☆10Updated 8 years ago
Alternatives and similar repositories for bagofwords:
Users that are interested in bagofwords are comparing it to the libraries listed below
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- A language for filtering, matching, and validating Python dictionaries☆47Updated last year
- Python3 SOAP client built with lxml and requests.☆43Updated 4 years ago
- A Python library for finding feed links on websites.☆50Updated 2 years ago
- ☆18Updated 8 years ago
- sitemap.xml generation using lxml with support for alternates.☆12Updated last year
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Extract, parse and populate templates from strings☆27Updated 5 years ago
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 8 years ago
- Extends zip() and itertools.zip_longest() to generate named tuples.☆23Updated 5 years ago
- ⇔ IterTable is a Pythonic API for iterating through tabular data formats, including CSV, XLSX, XML, and JSON.☆51Updated last year
- Detect and classify pagination links☆15Updated 4 years ago
- Decorator based advanced configuration engine.☆22Updated 3 weeks ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Faster replacement for Python's urlparse module☆46Updated 6 years ago
- 📑⚙️ Python/Django reference implementation of the ERAV data model☆21Updated 5 years ago
- Scrapy spider middleware to clean up query parameters in request URLs☆25Updated 8 years ago
- xmldataset: xml parsing made easy 🗃️☆78Updated 4 years ago
- gametight lightweight caching library for python☆64Updated 2 years ago
- extract difference between two html pages☆32Updated 6 years ago
- 📚 Ordered Multivalue Dictionary. Helps power furl.☆68Updated 2 years ago
- Streaming newline delimited JSON I/O.☆12Updated last year
- ☆15Updated 5 years ago
- ☆38Updated 8 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- Python module to add support for ORM-style filtering to any list of items☆22Updated 6 years ago
- URL Transformation, Sanitization☆103Updated last year
- Set of basic Python collections backed by Redis☆117Updated 9 months ago