pierrepita / atyimoLinks
☆13Updated 6 years ago
Alternatives and similar repositories for atyimo
Users that are interested in atyimo are comparing it to the libraries listed below
Sorting:
- Python wrapper for a C++ Double Metaphone☆15Updated 3 weeks ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A maximum-strength name parser for record linkage.☆37Updated 3 weeks ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 8 months ago
- A tool to read CSV files with CSVW metadata and transform them into other formats.☆32Updated 6 years ago
- A selection of business datasets☆18Updated 5 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 2 months ago
- framework for making streamcorpus data☆11Updated 8 years ago
- Python implementations of record linkage blocking techniques.☆20Updated last year
- Self-Service Semantic Suite (S4)☆17Updated 8 years ago
- Dexter document monitor for MMA☆16Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- This repository explores various Numpy commands which are quite useful for working with datasets and handling array operations.☆13Updated 6 years ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated last week
- ☆16Updated this week
- ☆16Updated 8 months ago
- The Path of the PyData Ninja☆16Updated 9 years ago
- Comparing Polars to Pandas and a small introduction☆44Updated 4 years ago
- Markdown -> IPython conversion tool☆15Updated 10 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Jupyter notebook extension - accumulate multiple outputs from a code cell into tabs☆10Updated 6 months ago
- ☆30Updated 2 years ago
- IPython Magic for exporting pandas objects to Excel☆13Updated 7 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Using textstat to write better blogs and improve readability.☆8Updated 5 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 5 months ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago