python package for performing deduplication using flexible text matching and cleaning in pandas dataframe
☆24Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for dupandas
Users that are interested in dupandas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- View a list of JSON-serializable dictionaries or a 2-D array, in HandsOnTable, in Jupyter Notebook.☆13Oct 11, 2018Updated 7 years ago
- A web-based version of the codebook, which generates a concise summary of every variable in a dataset.☆14Apr 9, 2022Updated 4 years ago
- Extension to Python-Markdown to translate pydantic's model fields to markdown table☆13Apr 19, 2024Updated 2 years ago
- Smart grid tables will convert ascii grid tables to proper html grid tables.☆18Dec 23, 2018Updated 7 years ago
- Medium Article☆11May 15, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Ant Design (v5) components for Plotly Dash☆14Feb 16, 2023Updated 3 years ago
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Nov 9, 2023Updated 2 years ago
- A collection of Pandas helper functions.☆14Apr 4, 2023Updated 3 years ago
- Publication: Linked electronic health records for research on a nationwide cohort including over 54 million people in England☆18Mar 12, 2023Updated 3 years ago
- An R package for generating analysis-ready data from laboratory records☆16Sep 1, 2023Updated 2 years ago
- Neural models for detecting and masking personal information from texts☆16Nov 25, 2022Updated 3 years ago
- Pandas dataframe to jQuery DataTables☆16Dec 5, 2018Updated 7 years ago
- A library to instantiate any Python object from configuration files.☆25Oct 12, 2022Updated 3 years ago
- Concise interactive data summaries in R☆59Feb 24, 2020Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Make working with pandas data and AWS DynamoDB easy☆21May 21, 2026Updated last week
- Highcharts meets Python in your jupyter notebook☆19Mar 25, 2026Updated 2 months ago
- Dev Centric Tools for Mkdocs Based Documentation☆27Nov 9, 2025Updated 6 months ago
- [under development] ETL materials to support proposal for CDM enhancements for clinical trial data☆24Jun 25, 2021Updated 4 years ago
- ☆22Oct 29, 2020Updated 5 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 5 months ago
- Collection of various biomedical data models in parseable formats.☆29Apr 20, 2026Updated last month
- The right datagrid for Dash when using Dash Mantine Components based on https://www.mantine-react-table.com.☆36Mar 28, 2023Updated 3 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆27Apr 16, 2021Updated 5 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- Phenotyping algorithms for common biomarkers in primary care EHR for UK Biobank☆32Feb 19, 2021Updated 5 years ago
- Auto evaluation software for programming projects and assignments. This repository contains AutolabJS server.☆18Feb 16, 2019Updated 7 years ago
- Enables loading react components in Dash applications directly from local project files, without any need for a separate build process.☆32Aug 19, 2024Updated last year
- Agile Dashboarding, anywhere☆18Aug 20, 2020Updated 5 years ago
- ☆13Feb 12, 2023Updated 3 years ago
- This python tool enables a variety of mappings of ICD codes (International Classification of Diseases) to different medical concepts with…☆46Apr 29, 2026Updated last month
- A high-level language that allows researchers to unambiguously define their research algorithms.☆18Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Services and guidelines for normalizing drug and other therapy terms☆15Feb 26, 2026Updated 3 months ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- Computer Vision tutorial for DH Summer School Antwerp☆11May 9, 2026Updated 3 weeks ago
- UniParse: A universal graph-based parsing toolkit☆10Oct 2, 2019Updated 6 years ago
- Demo to show how to reuse a document using different metadata.☆12Mar 15, 2025Updated last year
- Jupyter Notebooks and other code for 4CE data visualizations.☆13Jan 25, 2023Updated 3 years ago