python package for performing deduplication using flexible text matching and cleaning in pandas dataframe
☆25Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for dupandas
Users that are interested in dupandas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.☆13Mar 27, 2020Updated 6 years ago
- Demo of pointblank / projmgr / GitHub Actions / Slack workflow for data quality monitoring☆16Mar 29, 2023Updated 3 years ago
- A web-based version of the codebook, which generates a concise summary of every variable in a dataset.☆14Apr 9, 2022Updated 3 years ago
- Smart grid tables will convert ascii grid tables to proper html grid tables.☆18Dec 23, 2018Updated 7 years ago
- Medium Article☆11May 15, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ⚡️ Pandas dataframes with object oriented programming style (not maintained)☆11Mar 17, 2024Updated 2 years ago
- Ant Design (v5) components for Plotly Dash☆14Feb 16, 2023Updated 3 years ago
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Nov 9, 2023Updated 2 years ago
- Estimate similarity of medical concepts based on Unified Medical Language System (UMLS)☆16Jan 17, 2022Updated 4 years ago
- Publication: Linked electronic health records for research on a nationwide cohort including over 54 million people in England☆19Mar 12, 2023Updated 3 years ago
- An R package for generating analysis-ready data from laboratory records☆16Sep 1, 2023Updated 2 years ago
- Hebrew PHI identification and redaction toolkit☆20Mar 21, 2024Updated 2 years ago
- A library to instantiate any Python object from configuration files.☆24Oct 12, 2022Updated 3 years ago
- Concise interactive data summaries in R☆60Feb 24, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Make working with pandas data and AWS DynamoDB easy☆21Jan 26, 2025Updated last year
- Highcharts meets Python in your jupyter notebook☆19Updated this week
- Transfers issues from Pivotal to GitHub☆12Sep 29, 2017Updated 8 years ago
- Dev Centric Tools for Mkdocs Based Documentation☆27Nov 9, 2025Updated 4 months ago
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆24Jun 25, 2021Updated 4 years ago
- ☆22Oct 29, 2020Updated 5 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Collection of various biomedical data models in parseable formats.☆29Jan 5, 2026Updated 2 months ago
- Some extra hints for using AgGrid in Streamlit apps☆18Feb 28, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A command line and Python client for Open-Spending☆10Nov 24, 2017Updated 8 years ago
- Enables loading react components in Dash applications directly from local project files, without any need for a separate build process.☆33Aug 19, 2024Updated last year
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 5 years ago
- Phenotyping algorithms for common biomarkers in primary care EHR for UK Biobank☆27Feb 19, 2021Updated 5 years ago
- Auto evaluation software for programming projects and assignments. This repository contains AutolabJS server.☆18Feb 16, 2019Updated 7 years ago
- Agile Dashboarding, anywhere☆18Aug 20, 2020Updated 5 years ago
- A Brand New LSH: The fly’s olfactory circuits algorithm☆11May 2, 2018Updated 7 years ago
- This python tool enables a variety of mappings of ICD codes (International Classification of Diseases) to different medical concepts with…☆44Mar 3, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- dimple charts for R☆14Jun 19, 2015Updated 10 years ago
- A high-level language that allows researchers to unambiguously define their research algorithms.☆17Updated this week
- Services and guidelines for normalizing drug and other therapy terms☆13Feb 26, 2026Updated last month
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago