mongodb-labs / mongo-arrowLinks
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
☆113Updated last week
Alternatives and similar repositories for mongo-arrow
Users that are interested in mongo-arrow are comparing it to the libraries listed below
Sorting:
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆82Updated 3 months ago
- A data modelling layer built on top of polars and pydantic☆197Updated 2 years ago
- SQLAlchemy driver for DuckDB☆482Updated this week
- Coming soon☆62Updated 2 years ago
- Write your dbt models using Ibis☆74Updated 10 months ago
- Python bindings and arrow integration for the rust object_store crate.☆64Updated last year
- Python bindings for sqlparser-rs☆200Updated 8 months ago
- Stream Arrow data into Postgres☆276Updated 2 weeks ago
- A fast PostgreSQL Database Client Library for Python/asyncio.☆46Updated last year
- A repository of runnable examples using ibis☆46Updated last year
- A fast excel reader for Rust and Python☆210Updated this week
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆110Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆95Updated 10 months ago
- Apache DataFusion Python Bindings☆550Updated last week
- Arrow, pydantic style☆85Updated 3 years ago
- A library to convert a pydantic model to a pyarrow schema☆47Updated 8 months ago
- Dask integration for Snowflake☆30Updated 5 months ago
- Easy and flexible data contracts☆171Updated last week
- Turn Pydantic defined Data Models into CLI Tools☆156Updated 3 months ago
- Library/micro framework to create streaming applications with kafka in a fast way☆42Updated last week
- Polars plugin for stable hashing functionality☆84Updated last week
- Unified Distributed Execution☆57Updated last year
- Read Apache Arrow batches from ODBC data sources in Python☆73Updated last week
- asyncio bridge to the duckdb library☆47Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆45Updated this week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 3 months ago
- Polars plugin offering eXtra stuff for DateTimes☆228Updated last month
- Prefect integrations for working with Docker☆42Updated last year
- a lightweight, comprehensive solution for managing delta tables built on polars and deltalake☆121Updated last year
- Python binding for DataFusion☆59Updated 3 years ago