andreybratus / RefineOnSpark
☆33Updated 10 years ago
Alternatives and similar repositories for RefineOnSpark:
Users that are interested in RefineOnSpark are comparing it to the libraries listed below
- ☆41Updated 7 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Apache NiFi NLP Processor☆18Updated last year
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 8 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- Power BI API adapter for Apache Spark (deprecated)☆26Updated 7 years ago
- Visualize (.avdl and .proto format) schema files as a UML diagram using Graphviz☆30Updated 6 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆50Updated last year
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- ☆61Updated 6 months ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- ☆24Updated 9 years ago
- ☆71Updated 7 years ago
- spark-sparql-connector☆17Updated 9 years ago
- functionstest☆33Updated 8 years ago
- Analytic UIMA pipelines using Spark☆23Updated 9 years ago
- Vagrant, Apache Spark and Apache Zeppelin VM for teaching☆44Updated 7 years ago
- ☆92Updated 9 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- A collection of tools for accessing Neo4j graph databases from Apache NiFi.☆23Updated 6 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated last year
- Storm / Solr Integration☆19Updated last year
- ☆23Updated 5 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 7 years ago
- Cascading on Apache Flink®☆54Updated last year