Collection of some algorithms for entity resolution
☆28Sep 7, 2015Updated 10 years ago
Alternatives and similar repositories for entity_resolution_spark
Users that are interested in entity_resolution_spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkER: an Entity Resolution framework for Apache Spark☆65Mar 29, 2024Updated 2 years ago
- End-to-End Deep Entity Resolution☆33Jul 14, 2021Updated 4 years ago
- Tutorial code and data for the entity resolution workshops.☆45Jul 15, 2015Updated 10 years ago
- ☆15May 19, 2019Updated 6 years ago
- To reproduce experiments of the paper "Entity Matching with Transformer Architectures"☆27Nov 4, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TF-IDF with Spark for the Kaggle popcorn competition☆10Jul 1, 2015Updated 10 years ago
- Examples for using the dedupe library☆10Feb 22, 2016Updated 10 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆26Apr 14, 2023Updated 3 years ago
- An example of Spark and GraphX with Twitter as sample☆19Dec 29, 2016Updated 9 years ago
- Stanford CoreNLP Extensions: Fork to provide the ability to capture Multi-Word Expressions☆10Jun 14, 2022Updated 3 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47May 11, 2018Updated 7 years ago
- Risk Lab Research - ESG Company Ranking Score☆11May 1, 2020Updated 6 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Aug 10, 2017Updated 8 years ago
- SimMetrics is a Similarity Metric Library, based on previous work by http://sourceforge.net/projects/simmetrics/☆11Aug 25, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- export data from AirTable to a JSON on S3 via AWS Lambda☆16Feb 22, 2018Updated 8 years ago
- Python APIs for Open PermID☆15Jan 24, 2024Updated 2 years ago
- Duke is a fast and flexible deduplication engine written in Java☆626Oct 11, 2023Updated 2 years ago
- Encode / decode varints.☆14May 24, 2021Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- Ensime integration with Sublime Text 2 for Scala development☆139Jul 8, 2015Updated 10 years ago
- Clustering documents based on LSH☆14Apr 20, 2016Updated 10 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆16Oct 24, 2018Updated 7 years ago
- Official repository of the ACM SIGIR 2019 paper: "Fast Approximate Filtering of Search Results Sorted by Attribute" by Franco Maria Nardi…☆14Nov 7, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of many similarity join algorithms.☆15Mar 6, 2014Updated 12 years ago
- A simple, self-coded recurrent neural network that uses weekly changes in 10 major sector ETFs to predict which sectors will grow in the …☆18May 29, 2018Updated 7 years ago
- code for unsupervised entity resolution☆10Apr 26, 2019Updated 7 years ago
- Automatically exported from code.google.com/p/nyt-salience☆22Dec 15, 2015Updated 10 years ago
- The code of our AAAI'20 paper "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks"☆11Aug 10, 2020Updated 5 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20May 6, 2016Updated 9 years ago
- implementation of aided LLM codeplan algorithm in java☆10Jan 13, 2024Updated 2 years ago
- ☆18Updated this week
- ☆13Jan 1, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple wrapper and a command-line tool for Bloomberg's OpenFIGI API.☆29Sep 23, 2021Updated 4 years ago
- Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)☆24Nov 13, 2016Updated 9 years ago
- Official Code Framework of the paper "Deep Language-based Critiquing for Recommender System"☆19Jul 24, 2019Updated 6 years ago
- ☆16Oct 1, 2020Updated 5 years ago
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆188Dec 7, 2019Updated 6 years ago
- Mikkoo is a PgQ to RabbitMQ Relay☆12Mar 31, 2026Updated last month
- PyTorch implementation of "Distilling the Knowledge in a Neural Network"☆18Jul 24, 2023Updated 2 years ago