Apache Spark - A unified analytics engine for large-scale data processing
☆16Jul 24, 2023Updated 2 years ago
Alternatives and similar repositories for spark
Users that are interested in spark are comparing it to the libraries listed below
Sorting:
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Node.js SDK for Oracle NoSQL Database☆12Jan 5, 2026Updated last month
- Practical Byzantine Fault Tolerance Consensus and A Simple Distributed Ledger Application☆11Dec 15, 2017Updated 8 years ago
- Cloudera CDP SDK for Java☆16Updated this week
- SpyCore - Windows Malicious FIle Scanner (Distributes)☆14Jun 10, 2023Updated 2 years ago
- Llama - Low Latency Application MAster☆35Jun 27, 2022Updated 3 years ago
- Event Store implementation in Go☆14May 27, 2019Updated 6 years ago
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- Splunk app for archive management, including HDFS support.☆36Sep 3, 2014Updated 11 years ago
- Kafka Kubernetes Authenticator and Authorizer☆12Sep 5, 2023Updated 2 years ago
- Mirror of Apache Hadoop MapReduce☆21Feb 2, 2011Updated 15 years ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆334Sep 29, 2023Updated 2 years ago
- This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.☆14Jul 13, 2023Updated 2 years ago
- gRPC clustered event sourcing docker tool☆13Feb 17, 2026Updated last week
- pyspark-shared-spark-session-helper☆14Aug 29, 2024Updated last year
- ☆11May 4, 2022Updated 3 years ago
- Software developers can use sample code and documentation to use athenahealth's athenaPractice/athenaFlow FHIR API Server.☆19May 1, 2024Updated last year
- Blockchain benchmarking framework☆13Nov 7, 2018Updated 7 years ago
- An implementation of ScalaLab for Scala 3 (Dotty)☆17Dec 19, 2022Updated 3 years ago
- Platform to build distributed, scalable, enterprise-wide business applications☆19Jun 21, 2024Updated last year
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year
- This is the course taught by Prof.Jike Chong and Prof. Ian Lane from CMU☆11May 13, 2016Updated 9 years ago
- ViennaRNA Package consists of a C code library for the prediction and comparison of RNA secondary structures☆15May 20, 2022Updated 3 years ago
- Generate Solidity Code from its AST☆15Sep 1, 2016Updated 9 years ago
- ☆21Nov 24, 2017Updated 8 years ago
- Production-Grade Container Scheduling and Management☆18Jul 6, 2023Updated 2 years ago
- ☆17May 8, 2020Updated 5 years ago
- This library has moved to https://github.com/googleapis/google-cloud-java/tree/main/java-bigquerydatatransfer.☆18Jul 27, 2023Updated 2 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- Microservice to retrieve stock quotes☆20Oct 31, 2025Updated 4 months ago
- A Python interface to gb-io, a fast GenBank parser written in Rust.☆24Updated this week
- Just me playing with kafka and kafka-streams☆19Jul 10, 2022Updated 3 years ago
- D2IQ Helm Chart Repository☆17Updated this week
- Eval library and patched Scala-3/Dotty compiler. Evaluating source code and trees at compile time hacking multi-staging programming☆20Nov 2, 2022Updated 3 years ago
- A collection of curated ratelimiter adaptors for the KrakenD framework☆25Feb 9, 2026Updated 2 weeks ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.☆24Jul 13, 2023Updated 2 years ago
- 分布式数据库一致性组件☆19Oct 8, 2018Updated 7 years ago
- API endpoints for Millennium's HL7 FHIR implementation for patient access☆27Feb 20, 2026Updated last week