andreybratus / RefineOnSparkLinks
☆33Updated 10 years ago
Alternatives and similar repositories for RefineOnSpark
Users that are interested in RefineOnSpark are comparing it to the libraries listed below
Sorting:
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 8 years ago
- Apache NiFi NLP Processor☆18Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- ☆24Updated 9 years ago
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- ☆41Updated 7 years ago
- Single view demo☆14Updated 9 years ago
- ☆61Updated 8 months ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 8 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Cascading on Apache Flink®☆54Updated last year
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A collection of tools for accessing Neo4j graph databases from Apache NiFi.☆23Updated 6 years ago
- InsightEdge Core☆20Updated this week
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- ☆23Updated 5 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- Pig on Apache Spark☆83Updated 10 years ago
- ☆111Updated 8 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Comprises the whole SANSA stack☆15Updated 4 years ago
- ☆92Updated 9 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- ☆71Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- CDAP Applications☆43Updated 7 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆13Updated 6 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago