andreybratus / RefineOnSparkLinks
☆33Updated 11 years ago
Alternatives and similar repositories for RefineOnSpark
Users that are interested in RefineOnSpark are comparing it to the libraries listed below
Sorting:
- Mirror of Apache Stanbol (incubating)☆114Updated last year
- Apache NiFi NLP Processor☆18Updated last year
- The JUpyter-GRemlin Interface☆35Updated 5 months ago
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- ☆71Updated 7 years ago
- ☆41Updated 8 years ago
- ☆61Updated 2 weeks ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Dynamic Distributed Dimensional Data Model☆43Updated last year
- Comprises the whole SANSA stack☆15Updated 5 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Sparql -> SQL Rewriter enabling virtual RDB -> RDF mappings☆131Updated last year
- BatchRefine adds batch processing capabilities to OpenRefine☆51Updated 8 years ago
- Blazegraph Tinkerpop3 Implementation☆62Updated 4 years ago
- ☆19Updated 8 years ago
- ☆24Updated 9 years ago
- Blazegraph Samples with Sesame, Blueprints, and RDR☆71Updated 4 years ago
- A framework for systematically quality controlling big data.☆40Updated 2 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆45Updated 6 years ago
- Serializes RDF from a SPARQL endpoint to JSON-LD documents☆10Updated 7 years ago
- ☆107Updated 2 years ago
- ☆92Updated 9 years ago
- spark-sparql-connector☆17Updated 9 years ago
- ☆42Updated 3 years ago
- Mirror of Apache Apex malhar☆133Updated 5 years ago
- Docker image for apache zeppelin☆38Updated 8 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 3 months ago