☆18Aug 20, 2017Updated 8 years ago
Alternatives and similar repositories for learning-apache-spark
Users that are interested in learning-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LearningApacheSpark☆250Jan 3, 2024Updated 2 years ago
- hadoop-tutorials☆12Sep 4, 2013Updated 12 years ago
- WIPE implementation☆13Nov 26, 2023Updated 2 years ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆21Feb 10, 2026Updated 2 months ago
- ☆14Jun 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆28May 24, 2023Updated 2 years ago
- Guía para setear un proyecto básico en VSCode en C, para la materia de Sistemas Operativos☆24Sep 7, 2020Updated 5 years ago
- COCCL: Compression and precision co-aware collective communication library☆31Mar 16, 2025Updated last year
- PawarBI☆31May 11, 2023Updated 2 years ago
- Source Code for 'Advanced Data Analytics Using Python' by Sayan Mukhopadhyay☆67May 23, 2018Updated 7 years ago
- On-demand port forwarding to k8s.☆25Apr 10, 2026Updated 3 weeks ago
- This a simple Python daemon to monitor your Impala nodes.☆10Apr 13, 2021Updated 5 years ago
- SC24 Deep Learning at Scale Tutorial Material☆34Feb 5, 2025Updated last year
- Repository that showcases problems with Kafka rebalancing and explains how to fix them. Please visit our blog article to learn what Kafka…☆12Aug 21, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- My Raspberry Pi installation at home.☆11Mar 16, 2024Updated 2 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- ☆10Dec 5, 2022Updated 3 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Dec 16, 2022Updated 3 years ago
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago
- Hackerank Programming Challenges☆10May 8, 2021Updated 5 years ago
- High-performance Kafka backup and restore with point-in-time recovery. Supports Local file, S3, Azure, GCS. Open source (MIT).☆37Apr 27, 2026Updated last week
- Implementation of java.time for Scala.js and Scala Native☆16Apr 24, 2026Updated 2 weeks ago
- Azure Synapse Analytics Samples☆14Feb 15, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A clean, modern, and fully responsive HTML résumé (CV) template☆13Mar 13, 2026Updated last month
- A Guide to apache maven, httpclient, tomcat, ant and tiles.☆13Jul 23, 2018Updated 7 years ago
- A library to abstract between different lossless and lossy compressors☆37Feb 11, 2026Updated 2 months ago
- docs, codes and resources to prepare for the CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 with Python 3 certific…☆10Sep 25, 2019Updated 6 years ago
- Java OutOfMemory Example☆11Jun 19, 2021Updated 4 years ago
- ☆13Jul 15, 2023Updated 2 years ago
- Two-day level 300 Azure Synapse Analytics workshop☆11Mar 16, 2021Updated 5 years ago
- Auto-fixing error due to version upgrade, good practice etc.☆11Sep 5, 2020Updated 5 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Apr 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a list of YAML file examples for Docker, Kubernetes, Ansible. Also includes a Python script.☆10Jan 12, 2021Updated 5 years ago
- powershell_profile.ps1☆14Feb 11, 2026Updated 2 months ago
- All my leet code solutions in Java☆11Aug 9, 2021Updated 4 years ago
- Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo htt…☆13Nov 1, 2024Updated last year
- Data pipeline project using Data Factory, Databricks and Cosmosdb Graph, deployed using Azure DevOps, secured using firewalls and Azure A…☆11Dec 14, 2022Updated 3 years ago
- Example project on how to do state recovery in Apache Flink using Apache Avro☆12May 7, 2018Updated 8 years ago
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆43Aug 14, 2024Updated last year