snowplow / snowplow-python-analytics-sdk
Python SDK for working with Snowplow enriched events in Spark, AWS Lambda et al.
☆21Updated 2 months ago
Alternatives and similar repositories for snowplow-python-analytics-sdk:
Users that are interested in snowplow-python-analytics-sdk are comparing it to the libraries listed below
- Snowplow event tracker for Python. Add analytics to your Python and Django apps, webapps and games☆43Updated 2 months ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 2 months ago
- Example for an airflow plugin☆49Updated 8 years ago
- Scheduled task execution on top of AWS Data Pipeline☆43Updated 9 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated last year
- Docker images for Snowplow, Iglu and associated projects☆61Updated 3 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆70Updated 5 years ago
- Utils around luigi.☆65Updated 3 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Updated 5 years ago
- Amazon Redshift SQLAlchemy Dialect☆48Updated 9 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 4 months ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- Helpers & syntactic sugar for PySpark.☆61Updated last year
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- A client for the Confluent Schema Registry API implemented in Python☆22Updated 6 years ago
- Serializes data into a JSON format using AVRO schema.☆137Updated 3 years ago
- Python Driver for Apache Drill.☆58Updated last year
- Python bindings for TrailDB☆39Updated 5 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 3 years ago
- ☆54Updated 7 years ago
- Snowplow docker containers☆8Updated 8 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated 10 months ago
- Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and ma…☆41Updated 9 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- A Python library for dealing with splittable files☆42Updated 5 years ago