hickeroar / simplebayesLinks
A memory-based, optional-persistence naïve bayesian text classifier.
☆36Updated 10 years ago
Alternatives and similar repositories for simplebayes
Users that are interested in simplebayes are comparing it to the libraries listed below
Sorting:
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 15 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 10 years ago
- HAT-Trie for Python☆87Updated 9 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- A high-performance distributed web crawling & scraping framework written with golang and python.☆30Updated 9 years ago
- MapReduce platform in python☆34Updated 10 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 8 years ago
- Finite State Machine for Python (based on Jake Gordon's javascript-state-machine)☆163Updated 6 years ago
- Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity,…☆168Updated 6 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Experimental parallel data analysis toolkit.☆122Updated 4 years ago
- A benchmark framework for testing algorithms and pairwise metrics.☆67Updated 12 years ago
- Find which links on a web page are pagination links☆29Updated 9 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆64Updated 5 years ago
- Modularly extensible semantic metadata validator☆84Updated 10 years ago
- Distributed text analysis suite based on Celery☆96Updated 3 years ago
- Magic Type Introspection And Runtime Parameter Type/Value Checking. (NOT MAINTAIN ANYMORE)☆38Updated 9 years ago
- Python collections supporting parallel map/reduce style methods☆40Updated 2 years ago
- Zipfian capstone project - Dan Morris☆30Updated 8 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆31Updated last year
- Python Domain Specific Language Tools☆84Updated 3 years ago
- Tool to visualize data quickly with no brain usage for plot creation☆47Updated 3 months ago
- A totally different take on container boilerplate.☆137Updated 4 years ago
- Fast multi-keyword search engine for text strings☆258Updated last year
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 7 years ago
- High Level Kafka Scanner☆19Updated 8 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated last year
- A Flask web server for quickly asking someone a question☆81Updated 10 years ago
- Python Logging for Humans☆119Updated 9 years ago