dataiku / wt1
A simple, open and powerful Web tracker
☆30Updated 2 years ago
Related projects: ⓘ
- ☆57Updated this week
- A collection of datasets and databases☆24Updated 6 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 8 months ago
- ☆14Updated this week
- ☆33Updated 9 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆60Updated 2 weeks ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- ☆41Updated 7 years ago
- CDAP Applications☆43Updated 6 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- ☆38Updated this week
- functionstest☆33Updated 7 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆25Updated 3 months ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 7 years ago
- OrientDB ETL tools☆34Updated 3 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆29Updated 7 years ago
- ☆70Updated 3 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 8 years ago
- Dockerfile for Apache Zeppelin☆17Updated 8 years ago
- Power BI API adapter for Apache Spark (deprecated)☆26Updated 6 years ago
- ☆21Updated this week
- Library for organizing batch processing pipelines in Apache Spark☆41Updated 7 years ago
- ☆9Updated 9 years ago
- ☆23Updated 4 years ago
- ☆10Updated this week
- Cascading on Apache Flink®☆54Updated 7 months ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated 10 months ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago