Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage data warehouse workflow.
☆16Jan 21, 2021Updated 5 years ago
Alternatives and similar repositories for data-brewery
Users that are interested in data-brewery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building Json data pipeline within Snowflake using Streams and Tasks☆26Nov 15, 2019Updated 6 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Mar 20, 2017Updated 9 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆29Feb 14, 2026Updated 3 months ago
- running apache spark with docker swarm☆34Feb 25, 2021Updated 5 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AWS Lambda function to get events in Kafka topic when files are uploaded to S3☆23Aug 16, 2018Updated 7 years ago
- Convert a username/group name to a uid/gid number☆18Oct 8, 2015Updated 10 years ago
- De-identify medical images with the help of Amazon Comprehend Medical and Rekognition.☆25Dec 10, 2020Updated 5 years ago
- A list of objects bound by prototype chain☆20Oct 25, 2025Updated 7 months ago
- ☆18Sep 14, 2019Updated 6 years ago
- Taking IMDBs database dumps and turning them into a multiple projects☆20Aug 31, 2019Updated 6 years ago
- export secrets to environment variables☆15Sep 23, 2022Updated 3 years ago
- ☆11Sep 23, 2019Updated 6 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Extension to connect OpenPAI clusters, submit AI jobs, simulate jobs locally, manage files, and so on.☆15Dec 10, 2022Updated 3 years ago
- A cool simple example of functional data engineering☆35Mar 13, 2023Updated 3 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆91Apr 29, 2019Updated 7 years ago
- This is a library to assist in animating changes to the DOM, to use it☆11Apr 22, 2020Updated 6 years ago
- The FAQ for the Community Profile Report☆11Aug 30, 2022Updated 3 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated this week
- An implementation of GlowTTS designed to work with Gruut☆12Mar 9, 2022Updated 4 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Aug 9, 2016Updated 9 years ago
- "C" APIs for HBase☆11Dec 17, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tool for translating Scala source code into readable and maintainable Java code☆13Jan 3, 2026Updated 4 months ago
- Connects Campaign Manager to the RTB4FREE bidders☆14Nov 16, 2022Updated 3 years ago
- a simple read-only sequence database, designed for short reads☆20Dec 19, 2016Updated 9 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 10 years ago
- The Azure Integration Migrator Model repo contains the source and target modeling entities along with template configuration and Liquid r…☆10Jun 28, 2024Updated last year
- 3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow☆13Aug 17, 2019Updated 6 years ago
- Ansible Role: oracle-database☆16Apr 28, 2016Updated 10 years ago
- Resources for teaching Python for Data Wrangling at NICAR 2016.☆12Mar 11, 2016Updated 10 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Jul 7, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Delta Lake Examples☆11Apr 24, 2020Updated 6 years ago
- Generative Art Experiments using Haskell, GHCJS, and Reflex (FRP)☆18Mar 16, 2019Updated 7 years ago
- Reagent interface to the Mafs interactive 2d math visualization library.☆15Jun 1, 2024Updated last year
- A project to develop a fully distributed MapReduce library for Haskell which makes using the MapReduce framework totally transparent for …☆20Nov 12, 2011Updated 14 years ago
- Manage Apache Atlas and Ranger configuration for your Hadoop environment.☆16May 4, 2021Updated 5 years ago
- Vagrant configuration for a base box for Wagtail site development☆18Apr 22, 2020Updated 6 years ago
- CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages☆20Jan 26, 2018Updated 8 years ago