mongodb-labs / mongo-arrowLinks
MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.
☆108Updated this week
Alternatives and similar repositories for mongo-arrow
Users that are interested in mongo-arrow are comparing it to the libraries listed below
Sorting:
- A data modelling layer built on top of polars and pydantic☆196Updated last year
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆77Updated last month
- Coming soon☆61Updated last year
- 🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.☆105Updated this week
- SQLAlchemy driver for DuckDB☆435Updated last week
- Write your dbt models using Ibis☆68Updated 4 months ago
- Python binding for DataFusion☆59Updated 2 years ago
- Python bindings for sqlparser-rs☆191Updated 2 months ago
- Arrow, pydantic style☆84Updated 2 years ago
- A fast PostgreSQL Database Client Library for Python/asyncio.☆45Updated 10 months ago
- A Python wrapper around calamine☆161Updated 3 weeks ago
- Apache DataFusion Python Bindings☆469Updated last week
- Read Apache Arrow batches from ODBC data sources in Python☆65Updated 2 weeks ago
- Python extensions for PRQL☆99Updated this week
- The Prefect API and backend☆243Updated last year
- Python bindings and arrow integration for the rust object_store crate.☆63Updated 11 months ago
- Distributed SQL Engine in Python using Dask☆406Updated 10 months ago
- Easy and flexible data contracts☆152Updated 3 weeks ago
- Turn Pydantic defined Data Models into CLI Tools☆154Updated 2 months ago
- Ibis Substrait Compiler☆103Updated this week
- RFC document, tooling and other content related to the dataframe API standard☆110Updated last year
- A repository of runnable examples using ibis☆44Updated last year
- Lightning fast OLAP-style point queries on Pandas DataFrames.☆119Updated 7 months ago
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆83Updated 4 months ago
- Unified Distributed Execution☆55Updated 8 months ago
- easy install parquet-tools☆180Updated last year
- A Jupyter server based on FastAPI☆270Updated this week
- Python package for executing Malloy☆31Updated 5 months ago
- A library to convert a pydantic model to a pyarrow schema☆38Updated 2 months ago