Pinball is a scalable workflow manager
☆1,043Dec 10, 2019Updated 6 years ago
Alternatives and similar repositories for pinball
Users that are interested in pinball are comparing it to the libraries listed below
Sorting:
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,683Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆44,430Updated this week
- Serving system for batch generated data sets☆177May 11, 2017Updated 8 years ago
- Secor is a service implementing Kafka log persistence☆1,858Feb 25, 2026Updated last week
- Web UI for PrestoDB.☆2,751May 20, 2021Updated 4 years ago
- A machine learning package built for humans.☆4,800Nov 6, 2025Updated 3 months ago
- Functional, Typesafe, Declarative Data Pipelines☆140Jan 29, 2018Updated 8 years ago
- [NOT MAINTAINED] Bubbles – Python ETL framework☆459Oct 4, 2017Updated 8 years ago
- Teletraan is Pinterest's deploy system.☆1,824Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,531Feb 9, 2026Updated 3 weeks ago
- Data-Centric Pipelines and Data Versioning☆6,288Feb 3, 2025Updated last year
- Simple DAG-based job scheduler in Python☆766Jul 31, 2019Updated 6 years ago
- Mantl is a modern platform for rapidly deploying globally distributed services☆2,982May 7, 2019Updated 6 years ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,086Dec 15, 2023Updated 2 years ago
- A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).☆926Updated this week
- A data science IDE for Python☆3,901Apr 16, 2018Updated 7 years ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,688Mar 1, 2023Updated 3 years ago
- Azkaban workflow manager.☆4,514Jul 3, 2024Updated last year
- A curated list of awesome ETL frameworks, libraries, and software.☆3,520Jul 23, 2024Updated last year
- Realtime analytics, this includes the core components of Pulsar pipeline.☆651Nov 6, 2015Updated 10 years ago
- Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.☆6,909Jul 4, 2024Updated last year
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,241Updated this week
- Apache Pinot - A realtime distributed OLAP datastore☆6,037Updated this week
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,543Sep 4, 2024Updated last year
- Disque is a distributed message broker☆8,065Mar 17, 2021Updated 4 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago
- Netflix's distributed Data Pipeline☆797Apr 10, 2023Updated 2 years ago
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Dec 28, 2021Updated 4 years ago
- Apache Superset is a Data Visualization and Data Exploration Platform☆70,755Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,261Updated this week
- Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules☆4,380Jun 29, 2022Updated 3 years ago
- Data Migration for the Blaze Project☆1,005Jul 15, 2022Updated 3 years ago
- A powerful workflow engine implemented in pure Python☆1,860Jan 15, 2026Updated last month
- High-performance time-series aggregation for PostgreSQL☆2,657Feb 20, 2022Updated 4 years ago
- A generic JSON document store with sharing and synchronisation capabilities.☆4,422Updated this week
- Python helpers for building dashboards using Flask and React☆2,270Jun 2, 2025Updated 9 months ago
- Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.☆9,690Updated this week
- StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and m…☆6,412Feb 19, 2026Updated 2 weeks ago
- StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environme…☆2,888Oct 23, 2023Updated 2 years ago