gbieul / spyrk-clusterView external linksLinks
Spyrk-cluster is a data mini-lab, considering the main technologies used these days. It's useful to either understand how to configure a cluster, or just to take it for granted to use for testing with submit or interactive jobs.
☆29Apr 7, 2021Updated 4 years ago
Alternatives and similar repositories for spyrk-cluster
Users that are interested in spyrk-cluster are comparing it to the libraries listed below
Sorting:
- Repositório da Trilha de Databricks☆38Dec 1, 2025Updated 2 months ago
- Localização / Tradução Português - Brasil / PT-BR☆12Jul 1, 2024Updated last year
- 🥪💾 A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.☆14Jan 23, 2025Updated last year
- Demo Application with DataSUS death records and Streamlit☆11Dec 14, 2019Updated 6 years ago
- Sample auto deploy an application combining Jenkins with Terraform and Ansible☆12Apr 17, 2022Updated 3 years ago
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Apr 3, 2021Updated 4 years ago
- Get files from ckan into the webstore.☆22Jan 6, 2022Updated 4 years ago
- scripts for performing logistic regression and benford analysis in brazil's declared donnations☆11Jun 30, 2017Updated 8 years ago
- A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)☆13May 2, 2021Updated 4 years ago
- Cliente para o WebService de consulta de preços e prazos dos correios.☆14Mar 4, 2018Updated 7 years ago
- Data Analysis Experiments☆12Nov 2, 2017Updated 8 years ago
- Geração, Leitura e Validação de QR Code no padrão BR Code☆10Nov 19, 2020Updated 5 years ago
- Azkaban Auror core for flow creation☆11Oct 6, 2020Updated 5 years ago
- Data pipeline to extract and provide Brazil inflation data☆10May 19, 2025Updated 8 months ago
- CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker☆14Dec 20, 2023Updated 2 years ago
- banco de dados e dataframes☆11Oct 8, 2022Updated 3 years ago
- QRCode scanner via WebRTC☆15Mar 11, 2014Updated 11 years ago
- Steve's coffee shop recipe project for the Pluralsight Course "Git Fundamentals"☆19Mar 13, 2023Updated 2 years ago
- Elements sourced from Reddit for the Aurora Character Tool☆16Jul 1, 2021Updated 4 years ago
- ☆15Aug 4, 2022Updated 3 years ago
- Automate tracks download from Deezer☆11Jun 29, 2015Updated 10 years ago
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- Repositório de arquivos e códigos usados no curso de hadoop map-reduce do blog http://www.codigofluente.com.br☆16Sep 19, 2018Updated 7 years ago
- ☆14May 11, 2025Updated 9 months ago
- Interstellar and Brachistochrone Solar System Calculators☆23Oct 19, 2025Updated 3 months ago
- Streaming data from a transactional database to a data warehouse using Kafka (Confluent Cloud), Snowflake, and PostgreSQL.☆16Aug 28, 2023Updated 2 years ago
- Modern Data Stack☆62Aug 8, 2025Updated 6 months ago
- Starter files☆14Jul 7, 2020Updated 5 years ago
- ☆16Feb 26, 2021Updated 4 years ago
- This repo contains a plugin for feast to run an offline store on Spark☆13Nov 17, 2022Updated 3 years ago
- CLI tool to query column-level lineage information from the Discovery API and check alignment to best practices☆21Dec 17, 2025Updated last month
- ☆12Jul 30, 2021Updated 4 years ago
- ☆15Apr 27, 2018Updated 7 years ago
- Tutorial Lambda Básico do Canal do Um Inventor Qualquer☆20Jan 9, 2022Updated 4 years ago
- AWS Batch Demo☆18Jul 31, 2018Updated 7 years ago
- Learn Python imports☆21Oct 7, 2021Updated 4 years ago
- ☆19Nov 8, 2022Updated 3 years ago
- Spark Databricks Notebooks☆14Dec 19, 2020Updated 5 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 6 months ago