Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
☆97Jun 10, 2026Updated this week
Alternatives and similar repositories for flowman
Users that are interested in flowman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library enabling DAG structuring of data processing programs such as ETLs☆17Apr 13, 2026Updated 2 months ago
- ☆11May 16, 2022Updated 4 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆19Apr 20, 2022Updated 4 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Dec 10, 2022Updated 3 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Jan 24, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Kubernetes operator for Apache Hadoop HDFS used by the Stackable Data Platform☆52Updated this week
- Observability Python library - Powered by Kensu☆22Oct 15, 2024Updated last year
- Explore external scalers built by the community.☆12Mar 23, 2026Updated 2 months ago
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- this repo provides best practice guidance, plan template, solution assessment tool etc. to help Machine Learning Studio(classic) customer…☆20Jul 23, 2024Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago
- ZIO wrapper for AWS S3 SDK async client☆11Feb 21, 2020Updated 6 years ago
- React Bootstrap 4 Tabs Component☆11Feb 3, 2023Updated 3 years ago
- Code snippets used in demos recorded for the blog.☆42Apr 30, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Trino load balancer with support for routing, queueing and auto-scaling☆37Apr 20, 2026Updated last month
- senseBox documentation as beautiful books☆11Apr 18, 2023Updated 3 years ago
- sbt plugin to detect Akka module mismatches and fail build☆10Sep 15, 2025Updated 9 months ago
- Simple application showing how to work with ScyllaDB and Golang using gocqlx.☆11Jul 8, 2023Updated 2 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,620Jun 8, 2026Updated last week
- An Operator for Apache Druid for Stackable Data Platform☆12Updated this week
- ☆14Oct 8, 2019Updated 6 years ago
- Elasticsearch querying library☆20Jun 16, 2019Updated 6 years ago
- Data Lineage Tracking And Visualization Solution☆660Jun 2, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Hadoop/Hive/Spark container to perform CI tests☆10Dec 26, 2020Updated 5 years ago
- Stripdown of the mean.io stack for the ngFantasyFootball application☆111Apr 30, 2014Updated 12 years ago
- Repository of the metadata specification mobilityDCAT-AP☆18Jun 8, 2026Updated last week
- Github bot for keeping your Bazel dependencies up-to-date and clean☆27Mar 20, 2020Updated 6 years ago
- A kubernetes operator for the Open Policy Agent☆21Updated this week
- A feature toggle framework for Java☆53May 26, 2026Updated 2 weeks ago
- Keras/Tensorflow 2D Convolutional MNIST Classifier☆11May 19, 2017Updated 9 years ago
- A benchmark for Solid to simulate vaults with social network data.☆11May 14, 2026Updated last month
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.☆447May 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- UI for mondrian-rest☆20Apr 17, 2019Updated 7 years ago
- DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles☆59Jun 4, 2026Updated last week
- ARM template to deploy a VM with IoT Edge pre-installed (via cloud-init)☆24Jun 4, 2024Updated 2 years ago
- Point your MySQL slow query log AND get execution plans and stats. Filter and sort them by properties.☆11May 18, 2023Updated 3 years ago
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 8 years ago
- Information about available credential formats☆14Nov 17, 2024Updated last year
- An implementation of VGG-M from Return of the Devil in the Details: Delving Deep into Convolutional Nets.☆12Dec 24, 2016Updated 9 years ago