beneath-hq / beneath
Beneath is a serverless real-time data platform ⚡️
☆84Updated 3 years ago
Alternatives and similar repositories for beneath:
Users that are interested in beneath are comparing it to the libraries listed below
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆123Updated 3 years ago
- Data pipelines from re-usable components☆108Updated 2 years ago
- ☆30Updated 3 years ago
- Data Catalog for Databases and Data Warehouses☆34Updated last year
- Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from…☆35Updated 2 years ago
- Airbyte is the go-sdk/cdk to help build connectors quickly in go. This package abstracts away much of the "protocol" away from the user a…☆38Updated last year
- Demos of Materialize, the operational data warehouse.☆52Updated last month
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last month
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- 🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)☆141Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- BoilingData JS client (NodeJS and Browsers)☆19Updated 6 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Batteries included toolkit for data engineering.☆34Updated 3 months ago
- Distributed Task Queue based Dask☆38Updated last year
- Events about the open source data stack☆13Updated 3 years ago
- A collection of python utility functions☆11Updated 9 months ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆78Updated 9 months ago
- Create elegant data pipelines and deploy to AWS Lambda or Airflow☆31Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- A playground for running duckdb as a stateless query engine over a data lake☆195Updated last year
- chDB AWS Lambda container☆16Updated last year
- Using the Parquet file format with Python☆15Updated last year
- Assessing whether data from database complies with reference information.☆42Updated last week
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- Serverless multi-protocol + multi-destination event collection system.☆202Updated 4 months ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆33Updated 2 years ago
- Ibis analytics, with Ibis (and more!)☆21Updated 6 months ago