datapractices / data-practices-siteLinks
Datapractices site
☆34Updated 3 months ago
Alternatives and similar repositories for data-practices-site
Users that are interested in data-practices-site are comparing it to the libraries listed below
Sorting:
- Events about the open source data stack☆13Updated 3 years ago
- Batteries included toolkit for data engineering.☆34Updated 6 months ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆33Updated 3 years ago
- A Github API client to extract events and actions, and load into a database☆28Updated 3 years ago
- Data abstraction, storage, discovery, and serving system☆32Updated 3 months ago
- The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and eg…☆33Updated last month
- Generating Realistic Synthetic Data☆39Updated last year
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 2 months ago
- A Singer tap for extracting data from the GitHub API☆74Updated 2 weeks ago
- Demos of Materialize, the operational data warehouse.☆51Updated 4 months ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated 9 months ago
- Make Metabase More Awesome☆15Updated 11 months ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆104Updated 2 years ago
- Curated catalog of Apache OpenWhisk packages to interface with event producers and consumers☆34Updated 9 months ago
- Data Mesh Architecture☆79Updated last year
- Repo for the Stitch Docs☆57Updated 2 weeks ago
- Library and HTTP Service for validating JSONSchema events and producing them (to Kafka or elsewhere)☆22Updated 11 months ago
- Bytewax Helm charts repository☆12Updated last year
- This repository contains code to build an MVP search engine with google like interface.☆15Updated last month
- This project is created to promote and advocate the use of FOSS machine learning.☆46Updated 2 months ago
- ☆42Updated 5 years ago
- Apache NiFi Custom Processor Extracting Text From Files with Apache Tika☆35Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated last week
- ☆20Updated 2 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- A collection of talks and workshops provided by OAI members☆38Updated 3 years ago