datapractices / data-practices-siteLinks
Datapractices site
☆34Updated 7 months ago
Alternatives and similar repositories for data-practices-site
Users that are interested in data-practices-site are comparing it to the libraries listed below
Sorting:
- Data Mesh Architecture☆82Updated last month
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆106Updated 3 years ago
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆32Updated 5 months ago
- A Singer tap for extracting data from the GitHub API☆74Updated 2 weeks ago
- Generating Realistic Synthetic Data☆40Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated this week
- ODD Specification is a universal open standard for collecting metadata.☆144Updated last year
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆145Updated last year
- Repo for the Stitch Docs☆58Updated this week
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆37Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Events about the open source data stack☆13Updated 3 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆159Updated 2 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- The Data Product Descriptor Specification (DPDS) Repository☆81Updated 10 months ago
- ☆97Updated 2 years ago
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 4 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Data abstraction, storage, discovery, and serving system☆33Updated last month
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆352Updated last week
- Data pipelines from re-usable components☆107Updated last week
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆81Updated last week
- The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them…☆144Updated 2 years ago
- Fivetran data models for QuickBooks using dbt.☆32Updated last week
- Metadata tracking and UI service for Metaflow!☆215Updated 6 months ago
- Enterprise Information Service☆211Updated last week
- The metrics layer for your data. Join us at https://metriql.com/slack☆319Updated 2 years ago
- Batteries included toolkit for data engineering.☆36Updated 10 months ago