dmiro / bagofwordsLinks
The main goal this Python module is to provide functions to apply Text Classification.
☆10Updated 9 years ago
Alternatives and similar repositories for bagofwords
Users that are interested in bagofwords are comparing it to the libraries listed below
Sorting:
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 8 years ago
- csvcat☆22Updated 9 years ago
- Restrict crawl and scraping scope using matchers.☆26Updated 9 years ago
- Regular Expression based parsers for extracting data from natural languages☆71Updated 8 years ago
- 📚 Ordered Multivalue Dictionary. Powers furl.☆68Updated last month
- Interactive SQL database exploration in Python☆173Updated 3 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- dank key/value store high-level APIs☆18Updated 7 years ago
- A language for filtering, matching, and validating Python dictionaries☆47Updated 2 years ago
- Python text summarizer☆31Updated 5 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Debugging utility that helps you inspect your code☆15Updated 6 years ago
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 9 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 3 months ago
- xmldataset: xml parsing made easy 🗃️☆80Updated 5 years ago
- A simple functional pipelining library for Python☆37Updated 8 years ago
- Decorator based advanced configuration engine.☆23Updated last month
- SQL-style joins for Python iterables☆11Updated 9 years ago
- Python3 SOAP client built with lxml and requests.☆43Updated 5 years ago
- A lightweight python actor framework☆19Updated 9 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 5 years ago
- A backend for ZODB that stores pickles in a relational database.☆59Updated last month
- Detect and classify pagination links☆15Updated 5 years ago
- Lightweight Marshalling of Python 3 Objects.☆51Updated 10 years ago
- A series of tubes.☆56Updated last year
- Sunburnt offspring solr client☆27Updated 3 years ago
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- Retask is a simple task queue implementation written for human beings. It provides generic solution to create and manage task queues.☆120Updated 2 years ago
- Project automation task library for ‘Invoke’ tasks that are needed again and again.☆30Updated 2 years ago