Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
☆21Mar 20, 2017Updated 9 years ago
Alternatives and similar repositories for ETL-Starter-Kit
Users that are interested in ETL-Starter-Kit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Jan 21, 2021Updated 5 years ago
- Big data smart alarm by sql☆12May 11, 2021Updated 4 years ago
- ☆11Nov 29, 2020Updated 5 years ago
- Implementation of java.time for Scala.js and Scala Native☆16Updated this week
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- It is a kind of big data computing platform which is driven by the Flink SQL. In particular, it provides the SQL programming.☆21Jan 5, 2023Updated 3 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- Companion project to my "Akka and JDBC to Services" blog post.☆16Sep 29, 2016Updated 9 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆19Mar 9, 2017Updated 9 years ago
- Jogo estilo Space Invaders feito com HTML Canvas e JavaScript!☆16Sep 21, 2024Updated last year
- Generate big TPC-DS datasets with Databricks☆21Jan 3, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 5 months ago
- Training ground for experiments with Akka framework.☆11Jun 12, 2017Updated 8 years ago
- Simple command line application to read/write message to kafka topic using protobuf☆14Mar 27, 2023Updated 3 years ago
- Data engineering interviews Q&A for data community by data community☆66Jun 7, 2020Updated 5 years ago
- Example static schema registry for Iglu☆15Jun 21, 2023Updated 2 years ago
- ☆11Sep 23, 2019Updated 6 years ago
- demo clients☆20Jul 31, 2017Updated 8 years ago
- 👑 Fully on-chain auto-battler game owned by the community☆18Jun 28, 2024Updated last year
- Common components used across the datamountaineer kafka connect connectors☆21Feb 12, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Algorithmic Trading Pipeline for Online Betting Markets☆19Dec 7, 2022Updated 3 years ago
- "C" APIs for HBase☆11Dec 17, 2014Updated 11 years ago
- Automatically exported from code.google.com/p/scrobblemapper☆10May 16, 2016Updated 9 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 6 years ago
- Bootstrap Themeroller is an application that lets you customize the look and feel of Twitter's Bootstrap. It also provides a real time pr…☆58Aug 23, 2013Updated 12 years ago
- Genie Framework improves Spark Pool utilization by executing multiple Synapse notebooks on the same spark pool instance☆28Dec 19, 2023Updated 2 years ago
- A set of tools that make working with the Scala ecosystem even better.☆12Mar 16, 2026Updated last week
- The MessyBrainz project☆11May 1, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- On-demand port forwarding to k8s.☆23Feb 7, 2026Updated last month
- This a simple Python daemon to monitor your Impala nodes.☆10Apr 13, 2021Updated 4 years ago
- Docker image of webvirtmgr (https://hub.docker.com/r/odivlad/webvirtmgr/)☆11Jan 11, 2018Updated 8 years ago
- Repository that showcases problems with Kafka rebalancing and explains how to fix them. Please visit our blog article to learn what Kafka…☆12Aug 21, 2020Updated 5 years ago
- A project to develop a fully distributed MapReduce library for Haskell which makes using the MapReduce framework totally transparent for …☆20Nov 12, 2011Updated 14 years ago
- the database manager for Apache Hive☆21Jan 5, 2018Updated 8 years ago
- A zero-config OpenAI client with support for 20+ providers, API key rotation, rate limits, optional LangChain integration and more.☆19Dec 11, 2025Updated 3 months ago