Collection of some algorithms for entity resolution
☆28Sep 7, 2015Updated 10 years ago
Alternatives and similar repositories for entity_resolution_spark
Users that are interested in entity_resolution_spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SparkER: an Entity Resolution framework for Apache Spark☆65Mar 29, 2024Updated 2 years ago
- End-to-End Deep Entity Resolution☆33Jul 14, 2021Updated 4 years ago
- Tutorial code and data for the entity resolution workshops.☆45Jul 15, 2015Updated 10 years ago
- ☆15May 19, 2019Updated 6 years ago
- Update a Google Data Catalog tag with dbt Cloud run metadata☆22Jan 19, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A package for Bilateral and Multilateral Price Index Calculations☆11Updated this week
- An example of Spark and GraphX with Twitter as sample☆19Dec 29, 2016Updated 9 years ago
- Stanford CoreNLP Extensions: Fork to provide the ability to capture Multi-Word Expressions☆10Jun 14, 2022Updated 3 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47May 11, 2018Updated 7 years ago
- Risk Lab Research - ESG Company Ranking Score☆11May 1, 2020Updated 5 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Aug 10, 2017Updated 8 years ago
- SimMetrics is a Similarity Metric Library, based on previous work by http://sourceforge.net/projects/simmetrics/☆11Aug 25, 2016Updated 9 years ago
- export data from AirTable to a JSON on S3 via AWS Lambda☆16Feb 22, 2018Updated 8 years ago
- Encode / decode varints.☆14May 24, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆32Apr 15, 2023Updated 2 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Apr 5, 2023Updated 3 years ago
- Tools to query, download, preprocess and postprocess Sentinel-2 data☆14Nov 4, 2020Updated 5 years ago
- Visualization Recommendation Based on Analysis History☆15Aug 31, 2023Updated 2 years ago
- New algorithms for Large-scale Collaborative Ranking: PrimalCR and PrimalCR++☆12Sep 1, 2017Updated 8 years ago
- Ensime integration with Sublime Text 2 for Scala development☆139Jul 8, 2015Updated 10 years ago
- Diffing tools for comparing datasets in CSV, XLSX and other formats☆21May 22, 2019Updated 6 years ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆223Jul 12, 2025Updated 9 months ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆25Oct 6, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- code for unsupervised entity resolution☆10Apr 26, 2019Updated 6 years ago
- Automatically exported from code.google.com/p/nyt-salience☆22Dec 15, 2015Updated 10 years ago
- ☆15Aug 11, 2022Updated 3 years ago
- LLM Wiki - 用 LLM 构建持续积累的个人知识库,含 Claude Code Skill 和实战经验☆50Apr 5, 2026Updated last week
- The code of our AAAI'20 paper "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks"☆11Aug 10, 2020Updated 5 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20May 6, 2016Updated 9 years ago
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- ☆11Jul 21, 2017Updated 8 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Dec 14, 2016Updated 9 years ago
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"☆18Apr 3, 2026Updated last week
- ☆18Mar 18, 2026Updated 3 weeks ago
- ETL for the SPDR ETF holdings XLS documents☆26Mar 5, 2026Updated last month
- ☆10Jun 15, 2024Updated last year
- Simple wrapper and a command-line tool for Bloomberg's OpenFIGI API.☆29Sep 23, 2021Updated 4 years ago
- SnappyStore☆39Apr 16, 2023Updated 2 years ago