andreybratus / RefineOnSpark
☆33Updated 10 years ago
Alternatives and similar repositories for RefineOnSpark:
Users that are interested in RefineOnSpark are comparing it to the libraries listed below
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 8 years ago
- Apache NiFi NLP Processor☆18Updated last year
- ☆41Updated 7 years ago
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- ☆61Updated 5 months ago
- Cascading on Apache Flink®☆54Updated last year
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- Examples of spark-lucenerdd☆15Updated last year
- Storm / Solr Integration☆19Updated last year
- Sandbox for Apache nifi☆24Updated 3 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- ☆24Updated 9 years ago
- ☆92Updated 9 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- Dynamic Distributed Dimensional Data Model☆41Updated 10 months ago
- ☆23Updated 5 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- InsightEdge Core☆20Updated 11 months ago
- A complete custom processor project, for your reference.☆18Updated 9 years ago
- spark-sparql-connector☆17Updated 9 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- CDAP Applications☆43Updated 7 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆13Updated 7 years ago
- Scalable Optical Character Recognition with Apache NiFi and Tesseract☆32Updated 8 years ago
- Templates for projects based on top of H2O.☆37Updated this week
- Analytic UIMA pipelines using Spark☆23Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year