datapractices / data-practices-siteLinks
Datapractices site
☆34Updated 10 months ago
Alternatives and similar repositories for data-practices-site
Users that are interested in data-practices-site are comparing it to the libraries listed below
Sorting:
- Data Mesh Architecture☆84Updated 3 months ago
- A Singer tap for extracting data from the GitHub API☆75Updated 2 weeks ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆80Updated this week
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆107Updated 3 years ago
- Repo for the Stitch Docs☆58Updated last week
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆32Updated 8 months ago
- Data abstraction, storage, discovery, and serving system☆35Updated last week
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆144Updated last year
- ☆42Updated 5 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 4 years ago
- Generating Realistic Synthetic Data☆41Updated last year
- Events about the open source data stack☆13Updated 3 years ago
- ODD Specification is a universal open standard for collecting metadata.☆146Updated last year
- 🌄 Open Source AI & Data Landscape - provides overview of top tier projects in the open source AI and Data ecosystem, shows projects th…☆376Updated this week
- Enterprise Information Service☆216Updated this week
- This is a Vs Code extension for Apache Airflow☆41Updated this week
- Sample configuration to deploy a modern data platform.☆89Updated 4 years ago
- Aiven "getting started" code examples☆40Updated last week
- Legend Studio☆109Updated last week
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆39Updated 3 years ago
- Documentation and implementation of telemetry ingestion on Google Cloud Platform☆86Updated this week
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆81Updated 2 weeks ago
- The Data Product Descriptor Specification (DPDS) Repository☆83Updated last year
- Data pipelines from re-usable components☆107Updated 2 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- An open specification for data products in Data Mesh☆63Updated 4 months ago
- Fivetran data models for QuickBooks using dbt.☆33Updated last week
- ☆100Updated 2 years ago