ngmarchant / oasis
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).
☆15Updated 3 years ago
Alternatives and similar repositories for oasis:
Users that are interested in oasis are comparing it to the libraries listed below
- Scalable String Similarity Joins in Python☆38Updated 7 months ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Venn diagrams with word clouds☆50Updated 9 months ago
- This is a set of utilities and formats that illustrate how one could begin to perform operations on causal graphs and sample over these g…☆26Updated 9 years ago
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- Datadiff is diff for data☆26Updated 5 years ago
- IPython Magic for exporting pandas objects to Excel☆13Updated 7 years ago
- Multidimensional data explorer and visualization tool.☆55Updated 7 years ago
- Patsy Adaptors for Scikit-learn☆48Updated 5 years ago
- Ensemble topic modelling with pLSA☆114Updated 3 years ago
- bayesian graphical modelling and a bit of do-calculus for discrete data.☆27Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- ☆32Updated 7 years ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 7 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Python library for Ceteris Paribus Plots (What-if plots)☆19Updated 3 years ago
- A maximum-strength name parser for record linkage.☆36Updated last week
- Scikit-learn compatible Topic Modelling with Hierarchical Statistical Block Models (Gerlach, Peixoto and Altmann, 2018)☆28Updated 5 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆45Updated 5 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Python solver for mixed-effects models☆98Updated 7 years ago
- Dask tutorial for PyData DC 2016☆11Updated 8 years ago
- Async IPython Magic for Asynchronous Notebook Cell Execution☆22Updated 2 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- repository for R library "sbrlmod"☆25Updated 9 months ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 9 years ago