bholley / maul
A Machine Learning approach to User-Agent parsing
☆12Updated 14 years ago
Alternatives and similar repositories for maul:
Users that are interested in maul are comparing it to the libraries listed below
- Model Training tool for MITIE☆79Updated 9 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆140Updated 12 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- Cython implementation of DeepWalk☆54Updated last year
- Demo code for learning_text_transformer☆25Updated 10 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 9 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- (BROKEN, help wanted)☆15Updated 9 years ago
- Model assisted random sampling.☆120Updated 4 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- A startup search engine made using embeddings built on crunchbase company descriptions☆11Updated 9 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 11 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- a set of services that provide NLP facilities☆25Updated 4 years ago
- Determine if a web comment is spam or not using naive Bayes. Trained on youtube comments.☆92Updated 13 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated 3 weeks ago
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 7 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆34Updated 8 years ago
- ☆39Updated 8 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- ☆75Updated 11 years ago
- MITIE: library and tools for information extraction☆29Updated 10 years ago
- ☆61Updated 9 years ago
- Machine Learning Versioning made Simple☆38Updated 2 years ago
- Framework for evaluating text extraction algorithms implemented as web services☆42Updated 12 years ago