DavyLandman / ncd
Script to calculate the normalized compression distance of sets of files. It also tries to parallize the work over the available processors.
☆16Updated 9 years ago
Alternatives and similar repositories for ncd:
Users that are interested in ncd are comparing it to the libraries listed below
- Machine Learning Open Source Software☆23Updated 6 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated 9 months ago
- Markdown -> IPython conversion tool☆15Updated 9 years ago
- distributed word2vec in Chapel using adagrad☆16Updated 8 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 6 years ago
- Clustering documents based on LSH☆14Updated 8 years ago
- Accompanying code for using hoverpy with scikitlearn☆10Updated 8 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- ProbLog 2 is now at https://github.com/ML-KULeuven/problog☆10Updated 5 years ago
- Code and data from the paper "Email formality in the workplace: A case study on the Enron corpus"☆10Updated 9 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 6 years ago
- Tools to evaluate accuracies of various (research papers') metadata extraction libraries☆11Updated 9 years ago
- Using GP and metafeatures to grow better forests for prediction.☆10Updated 8 years ago
- A project that implements statistical methods for identifying anomalous files☆22Updated 10 years ago
- Inline, interactive graphs inside jupyter/ipython notebooks☆16Updated 7 years ago
- Operations for Immutable Notebook Documents☆29Updated 7 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- Experimental Vega Dataflow Visualization☆21Updated 8 years ago
- Online Bootcamp Student Project Presentation☆14Updated 7 years ago
- Implementation of QuadSketch algorithm☆11Updated last year
- vIPer: a new tool for IPython notebooks.☆60Updated 10 years ago
- ☆13Updated 10 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆17Updated 7 years ago
- A platform for unified linear and relational algebra analytics, built on the Accumulo NoSQL database☆11Updated 2 years ago