pditommaso / awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
☆6,299Updated last week
Alternatives and similar repositories for awesome-pipeline:
Users that are interested in awesome-pipeline are comparing it to the libraries listed below
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,145Updated last month
- A curated list of awesome ETL frameworks, libraries, and software.☆3,365Updated 7 months ago
- 📚 Parameterize, execute, and analyze notebooks☆6,105Updated 2 months ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,507Updated 6 months ago
- Curated list of resources about Apache Airflow☆3,748Updated 6 months ago
- Data-Centric Pipelines and Data Versioning☆6,208Updated last month
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊☆1,460Updated 3 months ago
- the portable Python dataframe library☆5,588Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,353Updated 5 months ago
- Parallel computing with task scheduling☆13,002Updated this week
- Declarative visualization library for Python☆9,627Updated this week
- Panel: The powerful data exploration & web app framework for Python☆5,093Updated this week
- Quickly and accurately render even the largest data.☆3,380Updated last week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,082Updated last year
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,553Updated 5 months ago
- a curated list of awesome streaming frameworks, applications, etc☆2,777Updated 2 months ago
- Data Apps & Dashboards for Python. No JavaScript Required.☆22,108Updated this week
- A curated list of awesome open source workflow engines☆6,840Updated 2 months ago
- A curated list of awesome Jupyter projects, libraries and resources☆4,163Updated this week
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,277Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆12,681Updated this week
- Run your code in the cloud, with technology so advanced, it feels like magic!☆2,606Updated last week
- Quilt is a data mesh for connecting people with actionable data☆1,331Updated this week
- NumPy and Pandas interface to Big Data☆3,194Updated last year
- With Holoviews, your data visualizes itself.☆2,761Updated this week
- Build, Deploy and Manage AI/ML Systems☆8,612Updated this week
- Actively curated list of awesome BI tools. PRs welcome!☆2,137Updated 6 months ago
- A curated list of data engineering tools for software developers☆7,151Updated 3 weeks ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,047Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,046Updated 5 months ago