High Performance Data Processing in Python
☆368May 23, 2026Updated this week
Alternatives and similar repositories for Bodo
Users that are interested in Bodo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source, community-driven REST catalog for Apache Iceberg!☆30Jun 26, 2024Updated last year
- Simple Workflow Framework based on Hamilton☆24May 2, 2026Updated 3 weeks ago
- Executable memory system for tabular data work☆510Updated this week
- Apache Arrow PostgreSQL connector☆63Feb 12, 2024Updated 2 years ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,610Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- Python stream processing for analytics☆41Updated this week
- An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now…☆2,946Updated this week
- A minimal Python library for Apache Arrow, connecting to the Rust Arrow crate☆260Mar 24, 2026Updated 2 months ago
- Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.☆2,622Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,505Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Mar 28, 2023Updated 3 years ago
- C++20 idiomatic APIs for the Apache Arrow Columnar Format☆142May 11, 2026Updated 2 weeks ago
- Multihreaded 64 bit c++ files for processing numba arrays☆18Apr 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Go library for decoding generic map values and native Go structures into Arrow.☆17Jan 30, 2026Updated 3 months ago
- Y-based Jupyter widgets for Python☆14May 6, 2026Updated 2 weeks ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆118Jul 27, 2025Updated 9 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆202Oct 20, 2025Updated 7 months ago
- Convert from protobuf to arrow and back☆40May 8, 2026Updated 2 weeks ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆152Jan 26, 2026Updated 3 months ago
- the portable Python dataframe library☆6,545Updated this week
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆39Feb 19, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Sep 25, 2024Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆323May 12, 2026Updated last week
- ☆68May 9, 2025Updated last year
- Apache DataFusion Python Bindings☆582Updated this week
- DuckDB for streaming data☆776Sep 4, 2025Updated 8 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- Typed, annotated vectors for well-documented datasets☆11Apr 15, 2026Updated last month
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆276Apr 17, 2026Updated last month
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,513Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Manage Multimodal Agentic Context Lifecycle with Lance☆66Apr 22, 2026Updated last month
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆92Updated this week
- Is the GIL seeing someone else? How's about repetitively calling and seeing how long it takes to answer?☆16Jan 7, 2026Updated 4 months ago
- GlareDB: A light and fast SQL database for analytics☆1,013Nov 14, 2025Updated 6 months ago
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 4 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,162Updated this week
- ☆192May 21, 2025Updated last year