Collection of some algorithms for entity resolution
☆28Sep 7, 2015Updated 10 years ago
Alternatives and similar repositories for entity_resolution_spark
Users that are interested in entity_resolution_spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End Deep Entity Resolution☆33Jul 14, 2021Updated 4 years ago
- An example of Spark and GraphX with Twitter as sample☆19Dec 29, 2016Updated 9 years ago
- Stanford CoreNLP Extensions: Fork to provide the ability to capture Multi-Word Expressions☆10Jun 14, 2022Updated 3 years ago
- ☆19Updated this week
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47May 11, 2018Updated 7 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Aug 10, 2017Updated 8 years ago
- Python APIs for Open PermID☆15Jan 24, 2024Updated 2 years ago
- Duke is a fast and flexible deduplication engine written in Java☆626Oct 11, 2023Updated 2 years ago
- Convolutional REpresenations for Music Analysis☆12Jul 5, 2016Updated 9 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Apr 5, 2023Updated 2 years ago
- Visualization Recommendation Based on Analysis History☆15Aug 31, 2023Updated 2 years ago
- New algorithms for Large-scale Collaborative Ranking: PrimalCR and PrimalCR++☆12Sep 1, 2017Updated 8 years ago
- ☆14Feb 12, 2016Updated 10 years ago
- Ensime integration with Sublime Text 2 for Scala development☆139Jul 8, 2015Updated 10 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 9 years ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆222Jul 12, 2025Updated 8 months ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- This is a comprehensive guide on how you can automate your feature engineering process.☆11Jun 25, 2018Updated 7 years ago
- Run streamlit web application, test and deploy to a cloud service (GCP, AWS, Heroku)☆14Oct 8, 2022Updated 3 years ago
- Automatically exported from code.google.com/p/nyt-salience☆22Dec 15, 2015Updated 10 years ago
- ☆15Aug 11, 2022Updated 3 years ago
- The code of our AAAI'20 paper "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks"☆11Aug 10, 2020Updated 5 years ago
- [VLDB 2024] Source code for FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data☆11Mar 11, 2025Updated last year
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- ☆11Dec 14, 2016Updated 9 years ago
- ☆10Jun 15, 2024Updated last year
- ☆13Jan 1, 2024Updated 2 years ago
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 10 months ago
- Official Code Framework of the paper "Deep Language-based Critiquing for Recommender System"☆19Jul 24, 2019Updated 6 years ago
- ☆16Oct 1, 2020Updated 5 years ago
- Repository for "Condolence and Empathy in Online Communities", EMNLP 2020☆10Nov 9, 2020Updated 5 years ago
- An API that uses machine learning to help the Ushahidi nonprofit do smarter crisis crowdsourcing.☆25Nov 17, 2013Updated 12 years ago
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆187Dec 7, 2019Updated 6 years ago
- PyTorch implementation of "Distilling the Knowledge in a Neural Network"☆18Jul 24, 2023Updated 2 years ago
- ☆11Oct 12, 2023Updated 2 years ago
- Examples and demos for ReproZip☆17Oct 24, 2022Updated 3 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆59Jun 10, 2021Updated 4 years ago
- Extensions for and tools to work with CoreNlp☆24Feb 26, 2022Updated 4 years ago
- 2003 Neural Networks experiments -- when it was not mainstream ;-)☆18Sep 13, 2016Updated 9 years ago