dataiku / wt1
A simple, open and powerful Web tracker
☆30Updated 2 years ago
Alternatives and similar repositories for wt1:
Users that are interested in wt1 are comparing it to the libraries listed below
- A collection of datasets and databases☆24Updated 6 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- CDAP Applications☆43Updated 7 years ago
- Apache NiFi Custom Processor Extracting Text From Files with Apache Tika☆35Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆13Updated 5 years ago
- ☆9Updated 9 years ago
- Examples of user defined functions for Apache Drill☆19Updated 7 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆77Updated 14 years ago
- OrientDB ETL tools☆34Updated 4 years ago
- Mirror of Apache Blur☆33Updated 6 years ago
- ☆33Updated 10 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 7 years ago
- functionstest☆33Updated 8 years ago
- ☆9Updated 9 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 8 years ago
- Temporal_Graph_library☆25Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- ☆41Updated 7 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 4 months ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆95Updated 5 years ago
- SparkListener that converts SparkListenerEvents to JSON and forwards them to an external service via RPC.☆17Updated 3 years ago
- Cascading on Apache Flink®☆54Updated last year
- Ambari and Cloudera Manager in Docker☆22Updated 6 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Power BI API adapter for Apache Spark (deprecated)☆26Updated 7 years ago