data-dev / DataTracer
Data Lineage Tracing Library
☆22Updated 3 years ago
Alternatives and similar repositories for DataTracer
Users that are interested in DataTracer are comparing it to the libraries listed below
Sorting:
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆92Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- real-time data + ML pipeline☆54Updated last month
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Python ELT Studio, an application for building ELT (and ETL) data flows.☆57Updated 3 years ago
- ☆70Updated 2 months ago
- ☆30Updated 3 years ago
- Open-source metadata collector based on ODD Specification☆43Updated last year
- SQLAlchemy for Dremio via the ODBC and Flight interface.☆30Updated 9 months ago
- This is where to start the data transformation with dbt and PostgreSQL☆8Updated 3 years ago
- Data Catalog for Databases and Data Warehouses☆34Updated last year
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis.☆101Updated last week
- A Python-to-SQL transpiler as replacement for Python Pandas☆48Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- A curated list of example code to collect data from Web APIs using DataPrep.Connector.☆34Updated 2 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 3 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Data pipelines from re-usable components☆108Updated 2 years ago
- ☆22Updated 2 months ago
- Demos of Materialize, the operational data warehouse.☆51Updated 2 months ago
- Transporter for integrating OpenLineage with OpenMetadata☆13Updated this week
- Make simple storing test results and visualisation of these in a BI dashboard☆44Updated last month
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from…☆35Updated 2 years ago
- Apache Arrow Flight example☆11Updated 4 years ago
- ☆12Updated 3 years ago
- Unity Catalog UI☆40Updated 8 months ago