High Performance Data Processing in Python
☆370Jun 26, 2026Updated last week
Alternatives and similar repositories for Bodo
Users that are interested in Bodo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source, community-driven REST catalog for Apache Iceberg!☆30Jun 26, 2024Updated 2 years ago
- Simple Workflow Framework based on Hamilton☆24May 2, 2026Updated 2 months ago
- Executable memory system for tabular data that works in your harness.☆526Updated this week
- Apache Arrow PostgreSQL connector☆64Feb 12, 2024Updated 2 years ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,666Jun 28, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- Python stream processing for analytics☆41Jun 17, 2026Updated 2 weeks ago
- An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now…☆3,069Updated this week
- A minimal Python library for Apache Arrow, connecting to the Rust Arrow crate☆264Jun 15, 2026Updated 2 weeks ago
- Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.☆3,100Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,587Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Mar 28, 2023Updated 3 years ago
- C++20 idiomatic APIs for the Apache Arrow Columnar Format☆142Jun 17, 2026Updated 2 weeks ago
- Multihreaded 64 bit c++ files for processing numba arrays☆19Apr 23, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Go library for decoding generic map values and native Go structures into Arrow.☆18Jan 30, 2026Updated 5 months ago
- Y-based Jupyter widgets for Python☆14Jun 20, 2026Updated 2 weeks ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆118Jul 27, 2025Updated 11 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆205Oct 20, 2025Updated 8 months ago
- Convert from protobuf to arrow and back☆41May 8, 2026Updated last month
- ☆117May 5, 2026Updated 2 months ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆154Jan 26, 2026Updated 5 months ago
- the portable Python dataframe library☆6,585Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆39Feb 19, 2026Updated 4 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Sep 25, 2024Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆327Updated this week
- ☆69May 9, 2025Updated last year
- Apache DataFusion Python Bindings☆593Updated this week
- DuckDB for streaming data☆776Sep 4, 2025Updated 10 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- Typed, annotated vectors for well-documented datasets☆12Apr 15, 2026Updated 2 months ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆279Apr 17, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,727Jun 27, 2026Updated last week
- Manage Multimodal Agentic Context Lifecycle with Lance☆72Jun 28, 2026Updated last week
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆105Updated this week
- GlareDB: A light and fast SQL database for analytics☆1,014Nov 14, 2025Updated 7 months ago
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 5 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,167May 19, 2026Updated last month
- ☆194May 21, 2025Updated last year