A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
☆2,155Apr 30, 2026Updated this week
Alternatives and similar repositories for fugue
Users that are interested in fugue are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Nov 10, 2025Updated 5 months ago
- the portable Python dataframe library☆6,521Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆4,317Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,446Updated this week
- An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…☆2,816Jan 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,057Apr 29, 2026Updated last week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,384Feb 10, 2026Updated 2 months ago
- Python SQL Parser and Transpiler☆9,196Updated this week
- Always know what to expect from your data.☆11,458Updated this week
- Making data lake work for time series☆1,191Aug 21, 2024Updated last year
- An abstraction layer for parameter tuning☆35Dec 16, 2025Updated 4 months ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,370Updated this week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,626May 29, 2025Updated 11 months ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,505Apr 1, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two line…