aamend / spark-gdeltLinks
Binding the GDELT universe in a Spark environment
☆24Updated 2 years ago
Alternatives and similar repositories for spark-gdelt
Users that are interested in spark-gdelt are comparing it to the libraries listed below
Sorting:
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Algorithms for "schema matching"☆26Updated 8 years ago
- ☆15Updated 2 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 6 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Stanford Entity-Resolution Framework☆24Updated 6 years ago
- deep entity resolution lite version☆11Updated 5 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 4 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- ☆11Updated 6 years ago
- An example of Spark and GraphX with Twitter as sample☆19Updated 8 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated last year
- ☆19Updated 6 years ago
- Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 7 years ago
- Apache NiFi NLP Processor☆18Updated last year
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 5 months ago
- Machine Learning Procedures and Functions for Neo4j☆64Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- ☆16Updated 4 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- A curated list of articles, papers and tools for managing the building and deploying of machine learning models, aka machine learning eng…☆18Updated 6 years ago
- Mention-anomaly-based event detection and tracking in Twitter☆17Updated 8 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 7 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 5 months ago
- DBpedia Distributed Extraction Framework: Extract structured data from Wikipedia in a parallel, distributed manner☆41Updated 2 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago