Apache Spark on AWS Lambda
☆157Dec 5, 2022Updated 3 years ago
Alternatives and similar repositories for spark-on-lambda
Users that are interested in spark-on-lambda are comparing it to the libraries listed below
Sorting:
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- Spark-cloud is a set of scripts for starting spark clusters on ec2☆12Dec 21, 2015Updated 10 years ago
- Green Tunnel Alternative for JVM Languages☆17Updated this week
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- Serverless proxy for Spark cluster☆325Oct 29, 2020Updated 5 years ago
- Mesos Integration Tests on Docker/Ec2☆15May 25, 2023Updated 2 years ago
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 3 months ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆613Jun 5, 2023Updated 2 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Jun 6, 2017Updated 8 years ago
- An Apache Mesos Framework that allows for replaying load over and over and over (and over) again☆10Aug 10, 2015Updated 10 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Oct 4, 2018Updated 7 years ago
- An integration framework that allows you to run and manage CrateDB via Apache Mesos.☆23Jan 30, 2019Updated 7 years ago
- ☆11Oct 29, 2018Updated 7 years ago
- Demo for meetup 2018-03-08☆11Mar 8, 2018Updated 8 years ago
- Python library for working with ThoughtSpot Modeling Language (TML) files programmatically☆10Mar 13, 2026Updated last week
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- @MissAmyTobey Writes☆49Feb 13, 2026Updated last month
- A Spark library for Amazon SageMaker.☆301Mar 8, 2025Updated last year
- Giraffa FileSystem (Slack: giraffa-fs.slack.com)☆18Mar 8, 2017Updated 9 years ago
- Essential Spark extensions and helper methods ✨😲☆766Sep 14, 2025Updated 6 months ago
- A JRuby binding for HBase☆38Jan 30, 2019Updated 7 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Sep 10, 2015Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- Redis constructs for cdk8s☆13Updated this week
- A conda-smithy repository for memory_profiler.☆12Updated this week
- ☆41Aug 17, 2016Updated 9 years ago
- A boilerplate for developing Mesos frameworks with JavaScript☆16Mar 7, 2017Updated 9 years ago
- This repo includes the source code used in the KubeCon NA 2020 Session.☆11Feb 17, 2023Updated 3 years ago
- Conway's Game of Life implemented in Scala.js☆10Mar 30, 2018Updated 7 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- Repo that will help you explore how to build a hybrid workflow using Apache Airflow and Amazon ECS Anywhere☆11Jul 12, 2022Updated 3 years ago
- GA Grid (Beta) is a distributive in memory Genetic Algorithm (GA) component for Apache Ignite. A GA is a method of solving complex optimi…☆11Nov 14, 2017Updated 8 years ago
- The Lightning Catalog is an open-source data catalog designed for preparing data at any scale in ad-hoc analytics, data virtualization, …☆37Feb 5, 2026Updated last month
- A model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.☆38Mar 5, 2026Updated 2 weeks ago
- Replication utility for AWS Glue Data Catalog☆79Aug 8, 2024Updated last year
- ☆110Apr 17, 2017Updated 8 years ago
- A JHipster app reporting to Spark Streaming☆14Dec 23, 2014Updated 11 years ago