Delta Lake Examples
☆11Apr 24, 2020Updated 5 years ago
Alternatives and similar repositories for Spark_Delta_Lake
Users that are interested in Spark_Delta_Lake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 5 years ago
- 数据处理平台☆16Feb 24, 2017Updated 9 years ago
- 蓝泰源大数据基础平台☆17Mar 7, 2018Updated 8 years ago
- ☆45Apr 27, 2020Updated 5 years ago
- Scripts to demonstrate VPC Service Controls between tenant and shared projects☆12Jun 11, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- Mirror of Apache Beam☆10Jan 27, 2021Updated 5 years ago
- B19415 - The Definitive Guide to Data Integration☆11Apr 15, 2024Updated last year
- A little crawler/sdk for retrieve in real time information about transport in Paris☆18Oct 24, 2015Updated 10 years ago
- Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components☆10Oct 11, 2019Updated 6 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Demo for making use of RATP's real-time API☆13May 3, 2017Updated 8 years ago
- A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational d…☆12Aug 15, 2021Updated 4 years ago
- Deploying a simple, customized Flask API in python via Google App Engine☆13Aug 20, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A modern API to get information from the RATP☆13Jul 12, 2023Updated 2 years ago
- ☆15Nov 30, 2023Updated 2 years ago
- HBase data access with SQL expressions and JDBC☆24Jan 29, 2011Updated 15 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Mar 20, 2017Updated 9 years ago
- 基于PowerCenter的数据质量监控系统☆13Dec 27, 2017Updated 8 years ago
- Data quality tools for Big Data☆19Oct 10, 2019Updated 6 years ago
- Data Quality Monitoring Tool☆15Dec 5, 2017Updated 8 years ago
- ☆16Jan 20, 2019Updated 7 years ago
- A repo containing code for a modern Docker + Jenkins CI / CD System☆15Aug 17, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Spark混合推荐系统大数据监控平台☆11May 1, 2018Updated 7 years ago
- 新零售大数据平台-运 维监控平台的开发☆14Jan 14, 2019Updated 7 years ago
- 集中管理数据库备份平台☆29Jun 2, 2016Updated 9 years ago
- Version controlled immutable storage for Big Data☆11Apr 20, 2021Updated 4 years ago
- A media server configuration to run Plex, Sonarr, Radarr and Transmission in Docker☆11Mar 7, 2022Updated 4 years ago
- Hadoop-Unit is a project which allow testing projects which need hadoop ecosysteme like kafka, solr, hdfs, hive, hbase, ...☆52Oct 19, 2025Updated 5 months ago
- Resources used in the production of my "Managing Infrastructure With Terraform" course☆23Aug 12, 2020Updated 5 years ago
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- Data encoding library for Haskell.☆12Aug 4, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 3 months ago
- node.js开发的一套本地mock静态数据平台系统☆16Oct 20, 2017Updated 8 years ago
- 数据服务 —— 写个 SQL 即可发布成 API☆13Mar 8, 2021Updated 5 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆10Feb 10, 2023Updated 3 years ago
- ☆23Oct 11, 2019Updated 6 years ago
- Read Delta tables without any Spark☆47Mar 8, 2024Updated 2 years ago
- 在Docker容器中运行Hadoop大数据组件和机器学习平台☆11Apr 3, 2019Updated 7 years ago