Workshop Big Data en Español
☆23Nov 9, 2023Updated 2 years ago
Alternatives and similar repositories for bigdata-workshop-es
Users that are interested in bigdata-workshop-es are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy Scheduler是一个分布式工作流任务调度系统,主要解决数据研发ETL错综复杂的依赖关系,而不能直观监控任务健康状态等问题。Easy Scheduler以DAG流式的方式将Task组装起来,可实时监控任务的运行状态,同时支持重试、从指定节点恢复失败、暂停及Kil…☆10Apr 9, 2019Updated 7 years ago
- My dotfiles.☆12Oct 10, 2025Updated 6 months ago
- Spark Operations Research☆12Sep 21, 2016Updated 9 years ago
- This is a GitHub for all of my NiFi Templates☆48Mar 25, 2026Updated 2 weeks ago
- Google FSI Accelerator Pattern☆13Jun 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Scala SDK for interfacing with HashiCorp's Nomad☆19Nov 30, 2022Updated 3 years ago
- ScalaTest plugin for Scala IDE☆41Jul 4, 2020Updated 5 years ago
- real time log event processing using spark, kafka & cassandra☆13Dec 4, 2014Updated 11 years ago
- A table-type dbt materialization for Snowflake to enable Time Travel☆22Jan 12, 2026Updated 3 months ago
- ☆43Feb 28, 2024Updated 2 years ago
- Data encoding library for Haskell.☆12Aug 4, 2023Updated 2 years ago
- Presto & Alluxio Dockers for blazing fast analytics☆13Nov 6, 2019Updated 6 years ago
- Example static schema registry for Iglu☆15Jun 21, 2023Updated 2 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆114Sep 21, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Managing machine learning life-cycle with MLflow tutorial☆23May 1, 2023Updated 2 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Aug 26, 2020Updated 5 years ago
- ☆11Sep 23, 2019Updated 6 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- Slides and Demo Script for SparkRSQL Presentation☆11Mar 17, 2015Updated 11 years ago
- OpenTelemetry Demo with Azure Databricks and Azure Monitor☆27Jun 6, 2024Updated last year
- An ETL tool for converting untyped CSV to parquet. Also triggers data lake updates.☆15Oct 29, 2021Updated 4 years ago
- Examples of use cases for GraalVM☆17May 6, 2025Updated 11 months ago
- Connects Campaign Manager to the RTB4FREE bidders☆14Nov 16, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Project defining the docker image that will support examples of algorithms created in this organization☆13Oct 22, 2017Updated 8 years ago
- 3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow☆12Aug 17, 2019Updated 6 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Jul 7, 2021Updated 4 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- Generative Art Experiments using Haskell, GHCJS, and Reflex (FRP)☆18Mar 16, 2019Updated 7 years ago
- Hadoop Data Integration with various databases, ftp servers, salesforce. Incremental update, dedup, append, merge your data on Hadoop.☆92Apr 11, 2013Updated 13 years ago
- SchemaRegistry bindings with Avro scheme to use with kafka-client☆16Aug 6, 2025Updated 8 months ago
- A project to develop a fully distributed MapReduce library for Haskell which makes using the MapReduce framework totally transparent for …☆20Nov 12, 2011Updated 14 years ago
- A simple example of how to integrate drools into an Apache Spark job☆29Apr 21, 2016Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- SQS-based Python SDK for streaming data in realtime to the Panoply platform☆17Jun 22, 2025Updated 9 months ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16May 11, 2019Updated 6 years ago
- dbt-github-workflow is a boilerplate that contains all the necessary configurations to set up a simple CI/CD pipeline for your data model…☆17Mar 27, 2022Updated 4 years ago
- OpenRTB v2.5 and OpenRTB Dynamic Native Ads v1.2 types for rust.☆21Feb 2, 2023Updated 3 years ago
- Minikube for big data with Scala and Spark☆15Oct 28, 2019Updated 6 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Jan 21, 2021Updated 5 years ago
- Statistical and exploratory Analysis of Cricket Data☆12Oct 19, 2015Updated 10 years ago