Interana / eventsimLinks
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
☆534Updated 3 years ago
Alternatives and similar repositories for eventsim
Users that are interested in eventsim are comparing it to the libraries listed below
Sorting:
- A curated list of data engineering tools for software developers☆494Updated 8 years ago
- ☆202Updated 2 years ago
- ETL best practices with airflow, with examples☆1,353Updated last year
- Spark style guide☆271Updated last year
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆361Updated 8 years ago
- Data ingestion library for Amundsen to build graph and search index☆204Updated last year
- Airflow basics tutorial☆396Updated 4 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆197Updated 5 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- A boilerplate for writing PySpark Jobs☆394Updated last year
- Guides and docs to help you get up and running with Apache Airflow.☆815Updated last month
- Apache Airflow integration for dbt☆412Updated last year
- Airflow Unit Tests and Integration Tests☆261Updated 3 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆682Updated 9 months ago
- Front-end service library for Amundsen☆279Updated last month
- How to build an awesome data engineering team☆101Updated 6 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆265Updated 5 years ago
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,454Updated 5 years ago
- Demo project for dbt on Databricks☆32Updated 5 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 6 months ago
- Metadata service library for Amundsen☆82Updated 3 weeks ago
- ☆198Updated 3 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆182Updated 2 years ago
- Performant Redshift data source for Apache Spark☆141Updated last month
- This repository has moved into https://github.com/dbt-labs/dbt-adapters☆444Updated 5 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Repository of sample Databricks notebooks☆273Updated last year
- Example DAGs using hooks and operators from Airflow Plugins☆347Updated 7 years ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆91Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Updated 5 years ago