High Performance Data Processing in Python
☆370Jun 12, 2026Updated this week
Alternatives and similar repositories for Bodo
Users that are interested in Bodo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source, community-driven REST catalog for Apache Iceberg!☆30Jun 26, 2024Updated last year
- Simple Workflow Framework based on Hamilton☆24May 2, 2026Updated last month
- Executable memory system for tabular data work☆519Updated this week
- Apache Arrow PostgreSQL connector☆64Feb 12, 2024Updated 2 years ago
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,638Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- Python stream processing for analytics☆41May 19, 2026Updated 3 weeks ago
- An extensible, state-of-the-art framework for columnar compression, and the fastest FOSS columnar file format. Formerly at @spiraldb, now…☆3,005Updated this week
- A minimal Python library for Apache Arrow, connecting to the Rust Arrow crate☆261Updated this week
- Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.☆2,929Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,556Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆333Mar 28, 2023Updated 3 years ago
- C++20 idiomatic APIs for the Apache Arrow Columnar Format☆142May 29, 2026Updated 2 weeks ago
- Multihreaded 64 bit c++ files for processing numba arrays☆19Apr 23, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Go library for decoding generic map values and native Go structures into Arrow.☆17Jan 30, 2026Updated 4 months ago
- Y-based Jupyter widgets for Python☆14May 6, 2026Updated last month
- IbisML is a library for building scalable ML pipelines using Ibis.☆118Jul 27, 2025Updated 10 months ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Jan 11, 2024Updated 2 years ago
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆205Oct 20, 2025Updated 7 months ago
- Convert from protobuf to arrow and back☆40May 8, 2026Updated last month
- ☆116May 5, 2026Updated last month
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆152Jan 26, 2026Updated 4 months ago
- the portable Python dataframe library☆6,573Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆39Feb 19, 2026Updated 3 months ago
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆279Sep 25, 2024Updated last year
- 🏃♀️ Minimalist SQL orchestrator☆325Updated this week
- ☆68May 9, 2025Updated last year
- Apache DataFusion Python Bindings☆587Jun 7, 2026Updated last week
- DuckDB for streaming data☆777Sep 4, 2025Updated 9 months ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- Typed, annotated vectors for well-documented datasets☆11Apr 15, 2026Updated last month
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆276Apr 17, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,609Updated this week
- Manage Multimodal Agentic Context Lifecycle with Lance☆71Updated this week
- The universal metrics layer. Compatible with 15+ formats: Cube, MetricFlow, LookML, Omni, BSL, LDM, Cortex, Malloy, OSI, SML, TML, Hex, R…☆96Updated this week
- Is the GIL seeing someone else? How's about repetitively calling and seeing how long it takes to answer?☆16Jan 7, 2026Updated 5 months ago
- GlareDB: A light and fast SQL database for analytics☆1,012Nov 14, 2025Updated 7 months ago
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 5 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,165May 19, 2026Updated 3 weeks ago