High Performance Data Processing in Python
☆364Apr 30, 2026Updated this week
Alternatives and similar repositories for Bodo
Users that are interested in Bodo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source, community-driven REST catalog for Apache Iceberg!☆30Jun 26, 2024Updated last year
- Simple Workflow Framework based on Hamilton☆24Apr 16, 2026Updated 2 weeks ago
- Composable expressions for data☆507Updated this week
- Apache Arrow PostgreSQL connector☆63Feb 12, 2024Updated 2 years ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,601Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- Python stream processing for analytics☆41Apr 14, 2026Updated 3 weeks ago
- A minimal Python library for Apache Arrow, connecting to the Rust Arrow crate☆260Mar 24, 2026Updated last month
- A drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.☆2,144Updated this week
- An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now…☆2,898Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,446Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Mar 28, 2023Updated 3 years ago
- Multihreaded 64 bit c++ files for processing numba arrays☆18Apr 23, 2024Updated 2 years ago
- C++20 idiomatic APIs for the Apache Arrow Columnar Format☆140Apr 28, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Go library for decoding generic map values and native Go structures into Arrow.☆17Jan 30, 2026Updated 3 months ago
- Y-based Jupyter widgets for Python☆14Apr 13, 2026Updated 3 weeks ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆119Jul 27, 2025Updated 9 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆202Oct 20, 2025Updated 6 months ago
- Convert from protobuf to arrow and back☆39Updated this week
- ☆111Jan 27, 2026Updated 3 months ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆152Jan 26, 2026Updated 3 months ago
- the portable Python dataframe library☆6,521Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆39Feb 19, 2026Updated 2 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Sep 25, 2024Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆321Apr 28, 2026Updated last week
- Apache DataFusion Python Bindings☆580Apr 25, 2026Updated last week
- ☆67May 9, 2025Updated 11 months ago
- DuckDB for streaming data☆775Sep 4, 2025Updated 8 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- Typed, annotated vectors for well-documented datasets☆11Apr 15, 2026Updated 2 weeks ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,370Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆275Apr 17, 2026Updated 2 weeks ago
- Manage Multimodal Agentic Context Lifecycle with Lance☆66Apr 22, 2026Updated last week
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆87Apr 23, 2026Updated last week
- Is the GIL seeing someone else? How's about repetitively calling and seeing how long it takes to answer?☆16Jan 7, 2026Updated 3 months ago
- GlareDB: A light and fast SQL database for analytics☆1,012Nov 14, 2025Updated 5 months ago
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 3 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,155Updated this week