Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.
☆114Nov 10, 2025Updated 4 months ago
Alternatives and similar repositories for tutorials
Users that are interested in tutorials are comparing it to the libraries listed below
Sorting:
- A collection of python utility functions☆11Mar 12, 2026Updated last week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,142Mar 12, 2026Updated last week
- An abstraction layer for parameter tuning☆35Dec 16, 2025Updated 3 months ago
- The Distributed Node2Vec Algorithm for Very Large Graphs☆18Jul 19, 2021Updated 4 years ago
- Fugue collections for Prefect 2.0☆38Oct 18, 2023Updated 2 years ago
- VS Code extension for PRQL lang☆30Updated this week
- ULMFiT Method for German Language☆15May 10, 2019Updated 6 years ago
- A Node.js connector for Delta Sharing.☆12Apr 3, 2025Updated 11 months ago
- Python binding for DataFusion☆59Jul 22, 2022Updated 3 years ago
- The WeatherWheel visualizes yearly weather data into a beautiful interactive compound radial chart.☆11Apr 26, 2021Updated 4 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Distributed SQL Engine in Python using Dask☆411Aug 29, 2024Updated last year
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Surrogate Assisted Feature Extraction☆37Aug 19, 2021Updated 4 years ago
- A fully-featured multi-source data pipeline for continuously extracting knowledge from COVID-19 data.☆21Jun 20, 2021Updated 4 years ago
- BigQuery backend for Ibis☆19Mar 29, 2023Updated 2 years ago
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 2 months ago
- Learn how to research fundamental factors using Pipeline, Alphalens, and Sharadar price and fundamental data.☆16Apr 23, 2024Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆96Feb 28, 2026Updated 2 weeks ago
- The Internals of PySpark☆28Dec 29, 2024Updated last year
- Python script to create CDX index files of WARC data☆16Sep 7, 2018Updated 7 years ago
- Cookiecutter template for FastAPI + Panel projects in Python☆10Apr 18, 2022Updated 3 years ago
- Python extensions for PRQL☆106Mar 13, 2026Updated last week
- R graphics device to render to {rgl}☆13Oct 6, 2020Updated 5 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Sep 5, 2012Updated 13 years ago
- Performant, composable online learning☆16Feb 22, 2021Updated 5 years ago
- general functions for your data .pipe()-lines.☆17Nov 8, 2023Updated 2 years ago
- BoilingData JS client (NodeJS and Browsers)☆19Sep 25, 2024Updated last year
- Extremely lightweight compatibility layer between pandas and Polars☆41Apr 26, 2024Updated last year
- A purely experimental DuckDB Deltalake extension☆95Mar 13, 2026Updated last week
- ☆22Mar 21, 2023Updated 2 years ago
- easy install parquet-tools☆183Jul 9, 2024Updated last year
- Foursquare Studio plugin for QGIS☆21Jul 20, 2023Updated 2 years ago
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- Snowflake & AWS Service Catalog Integration☆11Apr 7, 2023Updated 2 years ago
- Gamera 4 for Python 3☆14May 16, 2025Updated 10 months ago
- the portable Python dataframe library☆6,451Mar 13, 2026Updated last week
- An extension to add Prefect flow visualizations into you Sphinx documentation.☆13Feb 24, 2022Updated 4 years ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆103Mar 10, 2026Updated last week