pditommaso / awesome-pipelineLinks
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
☆6,455Updated last month
Alternatives and similar repositories for awesome-pipeline
Users that are interested in awesome-pipeline are comparing it to the libraries listed below
Sorting:
- A curated list of awesome ETL frameworks, libraries, and software.☆3,464Updated last year
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,522Updated 4 months ago
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊☆1,469Updated 10 months ago
- Curated list of resources about Apache Airflow☆3,835Updated last year
- Data-Centric Pipelines and Data Versioning☆6,256Updated 8 months ago
- Parallel computing with task scheduling☆13,518Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,282Updated this week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,083Updated last year
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,529Updated last year
- the portable Python dataframe library☆6,136Updated this week
- A DSL for data-driven computational pipelines☆3,164Updated this week
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,375Updated this week
- Docker Apache Airflow☆3,812Updated 2 years ago
- Always know what to expect from your data.☆10,816Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆42,676Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,294Updated last week
- Extract Transform Load for Python 3.5+☆1,603Updated 2 years ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,605Updated 4 months ago
- Python Stream Processing☆6,824Updated last year
- Python Extract Transform and Load Tables of Data☆1,289Updated last month
- Quilt is a data mesh for connecting people with actionable data☆1,348Updated this week
- ETL best practices with airflow, with examples☆1,344Updated last year
- An orchestration platform for the development, production, and observation of data assets.☆14,177Updated this week
- A Grammar of Graphics for Python☆4,383Updated 2 weeks ago
- Actively curated list of awesome BI tools. PRs welcome!☆2,223Updated last year
- A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.☆919Updated last week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆11,633Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆20,518Updated this week
- The Open Source Feature Store for AI/ML☆6,383Updated this week
- Build, Manage and Deploy AI/ML Systems☆9,558Updated last week