Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
☆18,738May 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for luigi
Users that are interested in luigi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆45,710Updated this week
- Parallel computing with task scheduling☆13,849Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆22,560Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platform☆73,212Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆28,621Jun 1, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,201Apr 1, 2026Updated 2 months ago
- An orchestration platform for the development, production, and observation of data assets.☆15,647Updated this week
- A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C proje…☆24,919Apr 1, 2026Updated 2 months ago
- Pinball is a scalable workflow manager☆1,047Dec 10, 2019Updated 6 years ago
- Distributed Task Queue (development branch)☆28,557Jun 3, 2026Updated last week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,587Apr 17, 2026Updated last month
- Python Development Workflow for Humans.☆25,069Updated this week
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,634May 19, 2026Updated 3 weeks ago
- Python Stream Processing☆6,822Jul 27, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data-Centric Pipelines and Data Versioning☆6,293Feb 3, 2025Updated last year
- Serverless Python☆11,837Mar 23, 2023Updated 3 years ago
- Accelerate your web app development | Build fast. Run fast.☆18,630May 31, 2026Updated last week
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,248Oct 29, 2025Updated 7 months ago
- Data Apps & Dashboards for Python. No JavaScript Required.☆24,234Jun 4, 2026Updated last week
- Python packaging and dependency management made easy☆34,276Updated this week
- Simple job queues for Python☆10,648Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆42,789Updated this week
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆20,218May 8, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Interactive Data Visualization in the browser, from Python☆20,400Updated this week
- A formatter for Python files☆13,981Updated this week
- Build, Manage and Deploy AI/ML Systems☆10,114Jun 3, 2026Updated last week
- Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.☆6,886Jul 4, 2024Updated last year
- 🦉 Data Versioning and ML Experiments☆15,662Updated this week
- The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data☆47,613Updated this week
- The no-magic web API and microservices framework for Python developers, with a focus on reliability and performance at scale.☆9,797Updated this week
- The uncompromising Python code formatter☆41,558Updated this week
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, a…☆26,338Jun 5, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Write scalable load tests in plain Python 🚗💨☆27,888Updated this week
- Python composable command line interface toolkit☆17,527Updated this week
- Ultra fast asyncio event loop.☆11,814May 4, 2026Updated last month
- 📚 Parameterize, execute, and analyze notebooks☆6,449May 12, 2026Updated 3 weeks ago
- A Fast, Extensible Progress Bar for Python and CLI☆31,187Updated this week
- Optional static typing for Python☆20,471Updated this week
- Machine Learning Toolkit for Kubernetes☆15,709May 24, 2026Updated 2 weeks ago