Spyrk-cluster is a data mini-lab, considering the main technologies used these days. It's useful to either understand how to configure a cluster, or just to take it for granted to use for testing with submit or interactive jobs.
☆29Apr 7, 2021Updated 4 years ago
Alternatives and similar repositories for spyrk-cluster
Users that are interested in spyrk-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A micro cluster lab to experiment Dask and Spark (Python and Scala) based on Docker☆16Mar 7, 2023Updated 3 years ago
- ☆12Jan 7, 2023Updated 3 years ago
- CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker☆14Dec 20, 2023Updated 2 years ago
- Repositório da Trilha de Databricks☆46Feb 19, 2026Updated last month
- Repo with scripts and automation to help ensure best practices in Google Data Catalog☆13Feb 12, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Hands-on LAB - Databricks SQL☆26Jul 16, 2024Updated last year
- Azkaban Auror core for flow creation☆11Oct 6, 2020Updated 5 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆12May 2, 2021Updated 4 years ago
- Recriação da pagina inicial do Facebook☆10Mar 27, 2021Updated 5 years ago
- ☆17May 26, 2023Updated 2 years ago
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 4 years ago
- País, cidades e estados. Com código IBGE Brasil. Com migrations, models, seeder, routes, config e views.☆15Feb 25, 2018Updated 8 years ago
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repo contains a plugin for feast to run an offline store on Spark☆13Nov 17, 2022Updated 3 years ago
- Big Data Ecosystem Docker☆80May 17, 2022Updated 3 years ago
- Get files from ckan into the webstore.☆22Jan 6, 2022Updated 4 years ago
- scripts for performing logistic regression and benford analysis in brazil's declared donnations☆11Jun 30, 2017Updated 8 years ago
- Files Course LaraChat☆13Jan 16, 2023Updated 3 years ago
- A demo repository for my ADF Dev Factory☆10Jul 5, 2023Updated 2 years ago
- CLI tool to query column-level lineage information from the Discovery API and check alignment to best practices☆21Dec 17, 2025Updated 3 months ago
- ☆10Nov 12, 2016Updated 9 years ago
- ☆32Aug 18, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Terraform code to deploy a SageMaker domain in VPC-only mode that supports multiple Studio and Canvas features☆20Aug 16, 2023Updated 2 years ago
- PHP Design Patterns☆16Jul 7, 2022Updated 3 years ago
- This extension provides Hive SQL language support for Visual Studio Code☆11Feb 22, 2022Updated 4 years ago
- AWS Batch Demo☆18Jul 31, 2018Updated 7 years ago
- ☆12Dec 7, 2022Updated 3 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated last month
- ☆21Oct 21, 2024Updated last year
- ☆15Aug 4, 2022Updated 3 years ago
- Cliente para o WebService de consulta de preços e prazos dos correios.☆14Mar 4, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆18Aug 4, 2022Updated 3 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 3 years ago
- Sample auto deploy an application combining Jenkins with Terraform and Ansible☆12Apr 17, 2022Updated 3 years ago
- Bootcamp Engenharia de Dados realizado pela IGTI☆29Feb 23, 2021Updated 5 years ago
- A benchmark for serverless analytic databases.☆26Jan 23, 2026Updated 2 months ago
- Modern Data Stack☆63Aug 8, 2025Updated 7 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 7 months ago