aamend / spark-gdelt
Binding the GDELT universe in a Spark environment
☆23Updated last year
Alternatives and similar repositories for spark-gdelt:
Users that are interested in spark-gdelt are comparing it to the libraries listed below
- Machine Learning Procedures and Functions for Neo4j☆64Updated 6 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆15Updated 2 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Jupyter notebooks showing how to use Neo4j Graph Algorithms☆52Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- ☆27Updated 2 years ago
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 2 months ago
- ☆15Updated 5 years ago
- This project provides procedures and functions to support machine learning applications with Neo4j.☆37Updated 6 years ago
- ☆16Updated 3 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated last year
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Spark NLP for Streamlit☆15Updated 3 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆18Updated 7 years ago
- Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 7 years ago
- ☆11Updated 6 years ago
- Demo of a supervised machine learning approach for Entity Resolution in graph using Neo4j GDS Link Prediction Pipelines☆22Updated 2 years ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- importing Thomson Reuters' permID dataset into Neo4j☆19Updated 7 years ago
- ☆39Updated 8 years ago
- I'm a curious person and analysing world news is fun. Here I'm gathering all my Gdelt-related projects.☆19Updated 4 years ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated 5 months ago
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 5 years ago
- A Linked Data API for Crunchbase☆13Updated 5 years ago
- Repository of Notebooks taken from https://neo4j.com/graph-algorithms-book/☆26Updated 4 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Keyword extraction package for Spark.☆12Updated 8 years ago