Collection of some algorithms for entity resolution
☆28Sep 7, 2015Updated 10 years ago
Alternatives and similar repositories for entity_resolution_spark
Users that are interested in entity_resolution_spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkER: an Entity Resolution framework for Apache Spark☆66Mar 29, 2024Updated 2 years ago
- End-to-End Deep Entity Resolution☆33Jul 14, 2021Updated 4 years ago
- TF-IDF with Spark for the Kaggle popcorn competition☆10Jul 1, 2015Updated 10 years ago
- Update a Google Data Catalog tag with dbt Cloud run metadata☆22Jan 19, 2021Updated 5 years ago
- Stanford CoreNLP Extensions: Fork to provide the ability to capture Multi-Word Expressions☆10Jun 14, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Aug 10, 2017Updated 8 years ago
- Duke is a fast and flexible deduplication engine written in Java☆623Oct 11, 2023Updated 2 years ago
- Encode / decode varints.☆14May 24, 2021Updated 5 years ago
- A framework to allow MapReduce applications to use Akka actors☆12Jan 15, 2022Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- New algorithms for Large-scale Collaborative Ranking: PrimalCR and PrimalCR++☆12Sep 1, 2017Updated 8 years ago
- I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved…☆17May 7, 2016Updated 10 years ago
- Click through rate prediction☆19Feb 14, 2017Updated 9 years ago
- ☆14Feb 12, 2016Updated 10 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tensorflow implementation of MTransE+KDCoE, and WK3l-60k dataset☆18Sep 19, 2019Updated 6 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 10 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆16Oct 24, 2018Updated 7 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- Sublime Text 2/3 plugin for keyboard driven file navigation☆45Aug 16, 2014Updated 11 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 5 years ago
- code for unsupervised entity resolution☆10Apr 26, 2019Updated 7 years ago
- Automatically exported from code.google.com/p/nyt-salience☆22Dec 15, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [VLDB 2024] Source code for FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data☆11Mar 11, 2025Updated last year
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20May 6, 2016Updated 10 years ago
- Minimal examples of data structures and algorithms in Scala☆24May 3, 2019Updated 7 years ago
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated last year
- SnappyStore☆39Apr 16, 2023Updated 3 years ago
- Official Code Framework of the paper "Deep Language-based Critiquing for Recommender System"☆19Jul 24, 2019Updated 6 years ago
- Turn remote MCP servers into local command workflows.☆61Feb 28, 2026Updated 3 months ago
- A simple kubernetes cron☆12Jun 28, 2016Updated 9 years ago
- ☆16Oct 1, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mikkoo is a PgQ to RabbitMQ Relay☆12May 19, 2026Updated 3 weeks ago
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆188Dec 7, 2019Updated 6 years ago
- ☆11Oct 12, 2023Updated 2 years ago
- ☆12Aug 29, 2015Updated 10 years ago
- Extensions for and tools to work with CoreNlp☆24Feb 26, 2022Updated 4 years ago
- ☆41Jan 21, 2022Updated 4 years ago
- Hybrid Question Answering (HAWK) -- is going to drive forth the OKBQA vision of hybrid question answering system using Linked Data and fu…☆16May 8, 2026Updated last month