Pinball is a scalable workflow manager
☆1,047Dec 10, 2019Updated 6 years ago
Alternatives and similar repositories for pinball
Users that are interested in pinball are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,743Updated this week
- Serving system for batch generated data sets☆179May 11, 2017Updated 9 years ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆45,788Jun 12, 2026Updated last week
- Functional, Typesafe, Declarative Data Pipelines☆140Jan 29, 2018Updated 8 years ago
- Web UI for PrestoDB.☆2,748May 20, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Secor is a service implementing Kafka log persistence☆1,858Mar 10, 2026Updated 3 months ago
- A machine learning package built for humans.☆4,803Nov 6, 2025Updated 7 months ago
- Simple DAG-based job scheduler in Python☆766Jul 31, 2019Updated 6 years ago
- Teletraan is Pinterest's deploy system.☆1,833Updated this week
- [NOT MAINTAINED] Bubbles – Python ETL framework☆462Oct 4, 2017Updated 8 years ago
- Data-Centric Pipelines and Data Versioning☆6,291Feb 3, 2025Updated last year
- A light-weight wrapper library around Spotify's Luigi workflow library to make writing scientific workflows more fluent, flexible and mod…☆335Dec 10, 2024Updated last year
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,588Apr 17, 2026Updated 2 months ago
- Azkaban workflow manager.☆4,504Jul 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Mantl is a modern platform for rapidly deploying globally distributed services☆2,978May 7, 2019Updated 7 years ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,086Dec 15, 2023Updated 2 years ago
- A WDL, CWL and Python API supporting easy-to-use workflow engine. It is scalable, efficient and cross-platform (Linux/macOS).☆932Jun 11, 2026Updated last week
- A data science IDE for Python☆3,893Apr 16, 2018Updated 8 years ago
- Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.☆6,883Jul 4, 2024Updated last year
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,632Mar 1, 2023Updated 3 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,267Updated this week
- Disque is a distributed message broker☆8,069Mar 17, 2021Updated 5 years ago
- Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules☆4,377Jun 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,636Jun 1, 2026Updated 2 weeks ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,532Sep 4, 2024Updated last year
- Data Migration for the Blaze Project☆1,006Jul 15, 2022Updated 3 years ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,563May 1, 2026Updated last month
- Realtime analytics, this includes the core components of Pulsar pipeline.☆650Nov 6, 2015Updated 10 years ago
- Netflix's distributed Data Pipeline☆796Apr 10, 2023Updated 3 years ago
- Apache Pinot - A realtime distributed OLAP datastore☆6,097Updated this week
- A powerful workflow engine implemented in pure Python☆1,900Jun 11, 2026Updated last week
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,033Nov 21, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python helpers for building dashboards using Flask and React☆2,266Jun 2, 2025Updated last year
- High-performance time-series aggregation for PostgreSQL☆2,663Feb 20, 2022Updated 4 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- A Cascading Workflow Visualizer☆83May 9, 2023Updated 3 years ago
- Apache Superset is a Data Visualization and Data Exploration Platform☆73,298Updated this week
- s3concurrent uploads files to or download files from S3.☆44Jun 10, 2016Updated 10 years ago
- A Python stream processing engine modeled after Yahoo! Pipes☆1,601Jun 3, 2026Updated 2 weeks ago