datapractices / data-practices-siteLinks
Datapractices site
☆34Updated 8 months ago
Alternatives and similar repositories for data-practices-site
Users that are interested in data-practices-site are comparing it to the libraries listed below
Sorting:
- Data Mesh Architecture☆84Updated last month
- Generating Realistic Synthetic Data☆41Updated last year
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆106Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆38Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- A Github API client to extract events and actions, and load into a database☆28Updated 4 years ago
- ODD Specification is a universal open standard for collecting metadata.☆145Updated last year
- Data abstraction, storage, discovery, and serving system☆33Updated 2 months ago
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆32Updated 6 months ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆145Updated last year
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆33Updated 2 years ago
- A Singer tap for extracting data from the GitHub API☆75Updated this week
- TypeDB Driver Example Projects and Tutorials☆86Updated last month
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆81Updated last week
- Use this repo to get to know about other repos and the overall organization of the MOSIP structure☆81Updated last year
- The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them…☆143Updated 2 years ago
- Data pipelines from re-usable components☆107Updated 3 weeks ago
- ☆42Updated 5 years ago
- ☆98Updated 2 years ago
- Repo for the Stitch Docs☆58Updated last week
- Events about the open source data stack☆13Updated 3 years ago
- An open specification for data products in Data Mesh☆63Updated 2 months ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 4 years ago
- dbt adapter for connecting to MindsDB☆19Updated last year
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆354Updated last week
- 🌄 Open Source AI & Data Landscape - provides overview of top tier projects in the open source AI and Data ecosystem, shows projects th…☆371Updated this week
- Sample configuration to deploy a modern data platform.☆89Updated 3 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆159Updated 2 years ago