Welcome to Snowman App – a Data Matching Benchmark Platform.
☆38Feb 9, 2023Updated 3 years ago
Alternatives and similar repositories for snowman
Users that are interested in snowman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Feb 26, 2022Updated 4 years ago
- Code repository for Mondrian, a project for multiregion template recognition in spreadsheets.☆14May 25, 2022Updated 3 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- LEMON: Explainable Entity Matching☆19Apr 6, 2022Updated 3 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Opensource scraper for analyse of social networks. Create nodes with egdes for you to visualize on editors like gephi.☆11Dec 2, 2025Updated 3 months ago
- Lab tasks for the course on "Data Engineering for Machine Learning"☆10May 1, 2023Updated 2 years ago
- A .NET library to work with Electronic Product Codes (EPC, SSCC, SGTIN)☆12Jun 25, 2020Updated 5 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆66Updated this week
- Entity resolution using zero labeled examples☆32Jun 29, 2024Updated last year
- Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity res…☆17Nov 18, 2020Updated 5 years ago
- Federal Cloud Computing Strategy Website☆15Oct 6, 2022Updated 3 years ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆222Jul 12, 2025Updated 8 months ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆25Apr 14, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- An API proxy written in Go to allow consuming apis via javascript without exposing the api keys☆25Nov 11, 2013Updated 12 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Mar 29, 2024Updated last year
- Resources for tackling record linkage / deduplication / data matching problems☆126Feb 22, 2024Updated 2 years ago
- ☆192May 29, 2024Updated last year
- FairPrep is a design and evaluation framework for fairness-enhancing interventions that treats data as a first-class citizen.☆11Mar 24, 2023Updated 3 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Feb 1, 2023Updated 3 years ago
- ☆20Dec 11, 2023Updated 2 years ago
- Ensime integration with Sublime Text 2 for Scala development☆139Jul 8, 2015Updated 10 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 9 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Datasets for Hyperparameter Optimization of Neural Machine Translation☆10Aug 19, 2024Updated last year
- Iocaine2 Tool for FFXI☆10May 9, 2022Updated 3 years ago
- A Generalized Data Cleaning System☆51Apr 28, 2016Updated 9 years ago
- Implementation of many similarity join algorithms.☆15Mar 6, 2014Updated 12 years ago
- A list of free data matching and record linkage software.☆400Feb 21, 2024Updated 2 years ago
- C++ 11 minifloat type implementation☆14Aug 3, 2015Updated 10 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 6 months ago
- ☆15Dec 28, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TensorFlow implementation of Pointer Networks☆12Aug 30, 2016Updated 9 years ago
- Source code for several Metanome data profiling algorithms☆59May 15, 2023Updated 2 years ago
- Pattern-based table discovery in Open Data CSV files☆25Dec 8, 2022Updated 3 years ago
- [VLDB 2024] Source code for FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data☆11Mar 11, 2025Updated last year
- Unofficial implementation of the paper "OpenTag: Open Attribute Value Extraction from Product Profiles"☆33Aug 22, 2018Updated 7 years ago
- High-level Rust library that binds to Poppler to extract text from a PDF☆11Dec 16, 2020Updated 5 years ago
- An implementation of a neural network training routine using derivative information in Pytorch.☆10Dec 19, 2020Updated 5 years ago