ngmarchant / oasisLinks
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).
☆15Updated 4 years ago
Alternatives and similar repositories for oasis
Users that are interested in oasis are comparing it to the libraries listed below
Sorting:
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆26Updated 8 years ago
- Fast hierarchical clustering routines for R and Python.☆154Updated 2 weeks ago
- Distributed Bayesian Entity Resolution in Apache Spark☆58Updated 4 years ago
- Ensemble topic modelling with pLSA☆114Updated 4 years ago
- This is a set of utilities and formats that illustrate how one could begin to perform operations on causal graphs and sample over these g…☆26Updated 10 years ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- Multidimensional isotonic regression☆28Updated 8 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last month
- Scikit-learn compatible Topic Modelling with Hierarchical Statistical Block Models (Gerlach, Peixoto and Altmann, 2018)☆28Updated 6 years ago
- Pandas Adapters For Scikit-Learn☆53Updated 7 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- Venn diagrams with word clouds☆50Updated last year
- Patsy Adaptors for Scikit-learn☆48Updated 6 years ago
- Mini module with syntax sugar for pandas/sklearn☆107Updated 5 years ago
- Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R☆67Updated last week
- Fast, flexible name matching for large datasets☆71Updated 4 months ago
- visJS2jupyter is a tool to bring the interactivity of networks created with vis.js into jupyter notebook cells☆78Updated 2 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Python solver for mixed-effects models☆97Updated 6 months ago
- Package for performing Reddit-based text analysis☆20Updated 6 years ago
- A selection of statistical graphics for vega in python, based on altair.☆103Updated 2 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- Draw interactive NetworkX graphs with Altair☆227Updated 2 years ago
- Interactive data exploration with Altair☆110Updated 5 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 4 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Scalable String Similarity Joins in Python☆39Updated last year