snowplow-archive / snowplow.github.com
Legacy Snowplow website, switched off 25 April 2017
☆16Updated 7 years ago
Alternatives and similar repositories for snowplow.github.com:
Users that are interested in snowplow.github.com are comparing it to the libraries listed below
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Python SDK for working with Snowplow enriched events in Spark, AWS Lambda et al.☆21Updated 2 months ago
- PySpark for Elastic Search☆55Updated 7 years ago
- REST web service for scoring PMML models☆50Updated 11 years ago
- A platform for real-time streaming search☆103Updated 8 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- A recommendation engine for subreddits using the kNN algorithm.☆27Updated 9 years ago
- ☆110Updated 7 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 3 years ago
- Fetch and plot AWS spot pricing history☆23Updated 8 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆43Updated 2 months ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Data models for snowplow analytics.☆126Updated last week
- SQL Recipes for Web Analytics☆34Updated 9 years ago
- Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and ma…☆41Updated 9 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- ☆34Updated 8 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Rudimentary Bayesian Beta-Bernoulli A/B testing inference and visualization code.☆63Updated 10 years ago
- Machines and people collaborating together through Jupyter notebooks.☆18Updated 7 years ago
- Python wrapper for the hadoop WebHDFS Rest API☆32Updated 9 years ago
- A guide for setting up Spark + PySpark under Ubuntu linux☆56Updated 7 years ago
- Hottest topic detection on Reddit online comment stream with Kafka, Spark Streaming and Cassandra☆8Updated 8 years ago