rustyrazorblade/spark-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rustyrazorblade/spark-training)

rustyrazorblade / spark-training

Spark Training Exercises

☆25

Alternatives and similar repositories for spark-training

Users that are interested in spark-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AppliedInfrastructure / cassandra-snapshot-tools
View on GitHub
Create, move, and restore keyspace snapshots across Apache/Datastax Cassandra clusters
☆30Sep 6, 2018Updated 7 years ago
indigo-dc / ansible-role-hadoop
View on GitHub
Ansible Role to install a Hadoop Cluster
☆10Sep 21, 2020Updated 5 years ago
sciencebox / uboxed
View on GitHub
ScienceBox in docker-compose
☆14Apr 9, 2021Updated 5 years ago
Ericsson / ecchronos
View on GitHub
Ericsson distributed repair scheduler for Apache Cassandra
☆37Updated this week
flutrack / Flutrack.org_webapp_source_code
View on GitHub
Flutrack platform gathers flu related tweets from the entire world, with searching tag, words that are influenza synonyms and flu symptom…
☆13Apr 22, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
amazon-contributing / aurora-dsql-benchbase-benchmarking
View on GitHub
cmu/benchbase fork for Aurora DSQL
☆10Feb 3, 2026Updated 5 months ago
smallk / smallk.github.io
View on GitHub
SmallK: very fast data clustering tools
☆13Apr 3, 2019Updated 7 years ago
instaclustr / cassandra
View on GitHub
Mirror of Apache Cassandra
☆14Updated this week
tomekl007 / Packt_Publishing_courses_by_Tomasz_Lelek
View on GitHub
https://www.packtpub.com/books/info/authors/tomasz-lelek
☆13Oct 30, 2021Updated 4 years ago
GalvanizeDataScience / building-spark-applications-live-lessons
View on GitHub
Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…
☆68Jan 8, 2016Updated 10 years ago
instaclustr / cassandra-sstable-tools
View on GitHub
Tools for working with sstables
☆103Jul 14, 2026Updated 2 weeks ago
datastax-archive / java-framework-compare
View on GitHub
Samples comparing popular Java frameworks: Spring, Quarkus, Micronaut, Helidon
☆11Dec 2, 2020Updated 5 years ago
voxpupuli / puppet-cassandra
View on GitHub
Installs Cassandra & DataStax Agent on RHEL/Ubuntu/Debian.
☆13Apr 27, 2026Updated 3 months ago
datastax-labs / Montecristo
View on GitHub
Apache Cassandra Health Check Tooling
☆12Jun 11, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
adrianulbona / borders
View on GitHub
☆17Jan 25, 2017Updated 9 years ago
MartijnVisser / flink-only-sql
View on GitHub
Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …
☆12Updated this week
agazzarini / apache-solr-essentials
View on GitHub
Source code associated with "Apache Solr Essentials" book
☆12Jan 27, 2025Updated last year
ashrithr / storm-helloworld
View on GitHub
sample hello world topology with pom
☆13Aug 27, 2013Updated 12 years ago
cangermueller / cheat
View on GitHub
My personal cheat sheets
☆12May 5, 2026Updated 2 months ago
spodkowinski / cassandra-reaper-ui
View on GitHub
Web UI for Cassandra Reaper
☆22May 25, 2017Updated 9 years ago
BrianGallew / cassandra_tools
View on GitHub
"top"-like tool for Cassandra
☆21Apr 13, 2015Updated 11 years ago
avikivity / shardsim
View on GitHub
Sharding simulator
☆22Apr 23, 2026Updated 3 months ago
kamal-s-bisht / cassandra-monitoring-by-ELK
View on GitHub
Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)
☆20Mar 16, 2017Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
KeithYue / weibo-keywords-crawler
View on GitHub
Crawl the related sina weibo content using the keywords, and save the results to txt file for future use.
☆18Oct 20, 2016Updated 9 years ago
argussecurity / docker-kafka-prometheus
View on GitHub
Based on wurstmeister's kafka-docker, with Prometheus JMX Exporter included
☆12Nov 24, 2016Updated 9 years ago
hemslo / poky-engine
View on GitHub
A simple search engine in python using Tornado, Scrapy, Redis and MongoDB
☆24Jun 21, 2013Updated 13 years ago
claudiobsd / x49gp
View on GitHub
This is a fork of x49gp which compiles on Ubuntu 12.04
☆16Dec 2, 2021Updated 4 years ago
gavodachs / dachs-doc
View on GitHub
Documentation for the DaCHS VO server
☆11May 13, 2026Updated 2 months ago
cloudera / CML_AMP_Image_Analysis
View on GitHub
Build a semantic search application with deep learning models.
☆16Jun 29, 2026Updated last month
okinesio / okinesio_hardware
View on GitHub
Resources for okinesio boards
☆13Mar 27, 2018Updated 8 years ago
GOCDB / gocdb
View on GitHub
Grid Operations Configuration Management Database. A Repository, Portal and REST style API for managing Grid and Cloud topology objects i…
☆12Jul 15, 2026Updated 2 weeks ago
JeremyGrosser / tablesnap
View on GitHub
Uses inotify to monitor Cassandra SSTables and upload them to S3
☆178May 8, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jeqo / post-scale-kafka-containers
View on GitHub
☆13Jan 15, 2017Updated 9 years ago
wxMaxima-developers / docker-wxmaxima
View on GitHub
Appimage build for wxmaxima
☆16Nov 24, 2025Updated 8 months ago
revolt-randy / Fritzing-Schematic-an-Inkscape-Extension.
View on GitHub
☆13Sep 30, 2024Updated last year
GoogleCloudPlatform / datacatalog-tag-history
View on GitHub
Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…
☆13Jul 21, 2021Updated 5 years ago
MOON-CLJ / scrapy_weibo
View on GitHub
distributed crawler for weibo
☆22May 23, 2013Updated 13 years ago
lag-linaro / stm32
View on GitHub
STM32 related gubbins
☆15Feb 16, 2016Updated 10 years ago
sschatts / conference_talks
View on GitHub
☆13Oct 5, 2019Updated 6 years ago