pditommaso / awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
☆6,363Updated 2 months ago
Alternatives and similar repositories for awesome-pipeline
Users that are interested in awesome-pipeline are comparing it to the libraries listed below
Sorting:
- A curated list of awesome ETL frameworks, libraries, and software.☆3,398Updated 9 months ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,381Updated 7 months ago
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊☆1,465Updated 5 months ago
- 📚 Parameterize, execute, and analyze notebooks☆6,156Updated last month
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,268Updated 3 weeks ago
- Curated list of resources about Apache Airflow☆3,779Updated 8 months ago
- Parallel computing with task scheduling☆13,190Updated last week
- Docker Apache Airflow☆3,801Updated 2 years ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,571Updated 7 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,152Updated last week
- Data-Centric Pipelines and Data Versioning☆6,225Updated 3 months ago
- Build, Manage and Deploy AI/ML Systems☆8,807Updated this week
- A curated list of awesome open source workflow engines☆7,039Updated 2 weeks ago
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,082Updated last year
- An orchestration platform for the development, production, and observation of data assets.☆13,121Updated this week
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆8,882Updated last week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,322Updated last week
- the portable Python dataframe library☆5,751Updated this week
- Declarative visualization library for Python☆9,768Updated last week
- A curated list of awesome big data frameworks, ressources and other awesomeness.☆13,605Updated 3 months ago
- A next-generation curated knowledge sharing platform for data scientists and other technical professions.☆5,518Updated 8 months ago
- Voilà turns Jupyter notebooks into standalone web applications☆5,688Updated last week
- 🦉 Data Versioning and ML Experiments☆14,462Updated this week
- Panel: The powerful data exploration & web app framework for Python☆5,205Updated this week
- Python Stream Processing☆6,787Updated 9 months ago
- Computing with Python functions.☆4,064Updated this week
- ETL best practices with airflow, with examples☆1,332Updated 7 months ago
- Always know what to expect from your data.☆10,376Updated last week
- The Open Source Feature Store for AI/ML☆6,056Updated this week
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts☆6,847Updated 3 weeks ago