Delta Lake Examples
☆11Apr 24, 2020Updated 6 years ago
Alternatives and similar repositories for Spark_Delta_Lake
Users that are interested in Spark_Delta_Lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 6 years ago
- 蓝泰源大数据基础平台☆17Mar 7, 2018Updated 8 years ago
- Scripts to demonstrate VPC Service Controls between tenant and shared projects☆12Jun 11, 2019Updated 6 years ago
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- A little crawler/sdk for retrieve in real time information about transport in Paris☆18Oct 24, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- Demo for making use of RATP's real-time API☆13May 3, 2017Updated 9 years ago
- A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…☆12Aug 15, 2021Updated 4 years ago
- Deploying a simple, customized Flask API in python via Google App Engine☆13Aug 20, 2017Updated 8 years ago
- HBase data access with SQL expressions and JDBC☆23Jan 29, 2011Updated 15 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Mar 20, 2017Updated 9 years ago
- 基于PowerCenter的数据质量监控系统☆13Dec 27, 2017Updated 8 years ago
- ☆16Jan 20, 2019Updated 7 years ago
- 新零售大数据平台-运维监控平台的开发☆14Jan 14, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Version controlled immutable storage for Big Data☆11Apr 20, 2021Updated 5 years ago
- Resources used in the production of my "Managing Infrastructure With Terraform" course☆23Aug 12, 2020Updated 5 years ago
- Data encoding library for Haskell.☆12Aug 4, 2023Updated 2 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 4 months ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆10Feb 10, 2023Updated 3 years ago
- node.js开发的一套本地mock静态数据平台系统☆16Oct 20, 2017Updated 8 years ago
- Read Delta tables without any Spark☆47Mar 8, 2024Updated 2 years ago
- 在Docker容器中运行Hadoop大数据组件和机器学习平台☆11Apr 3, 2019Updated 7 years ago
- RATP SDK - Retrieve schedules for any given RER (train), Metro, or Tramway stop in real time☆23Oct 23, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Terraform script for launching multiple EMR clusters for training purposes.☆16Oct 30, 2025Updated 6 months ago
- yet another databus that transfer/transform pipeline data between plugins☆29Jun 29, 2017Updated 8 years ago
- 利用SpringBoot整合HBase,基于HBaseJavaAPI的二次封装,可以直接引用jar包使用,目前测试已支持HBase1.1.2和HBase1.4.6以及HBase2.0.2三个版本。☆15Sep 25, 2024Updated last year
- Example static schema registry for Iglu☆15Jun 21, 2023Updated 2 years ago
- A Singer.io tap for extracting data from the AppsFlyer API☆11Updated this week
- Use maven-assembly-plugin to package a spring boot project into a non-fat jar☆10Jul 24, 2017Updated 8 years ago
- Hi Spring fans! In this installment of Spring Tips we look at the new WebMvc.fn programming model now available for Spring MVC users☆11Mar 30, 2019Updated 7 years ago
- springboot项目使用脚手架,集成redis、mysql、pg,hbase、elasticsearch、kafka等常用组件功能☆20Jun 20, 2022Updated 3 years ago
- Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + Kubernetes + Airflow + Jupyter☆11Oct 28, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple ToDo app using window.ipfs☆17May 1, 2025Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆74Mar 14, 2021Updated 5 years ago
- An ETL tool for converting untyped CSV to parquet. Also triggers data lake updates.☆15Oct 29, 2021Updated 4 years ago
- API to store Spring Batch (version 3+) job execution data in MongoDB☆12Jun 25, 2022Updated 3 years ago
- "C" APIs for HBase☆11Dec 17, 2014Updated 11 years ago
- Merge Dirty Data with Clean Reference Tables☆35Aug 3, 2021Updated 4 years ago