Java implementation of a mini Spark-like framework named MiniSpark that can run on top of a HDFS cluster. MiniSpark supports operators including Map, FlatMap, MapPair, Reduce, ReduceByKey, Collect, Count, Parallelize, Join and Filter.
☆37Jul 28, 2017Updated 8 years ago
Alternatives and similar repositories for MiniSpark
Users that are interested in MiniSpark are comparing it to the libraries listed below
Sorting:
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- A distributed in-memory key-value storage for billions of small objects.☆26Aug 23, 2019Updated 6 years ago
- [ACL 2019/AACL 2020] Second-Order Syntactic/Semantic Dependency Parsing With Mean Field Variational Inference (PyTorch)☆14Oct 22, 2020Updated 5 years ago
- Solitaire is a Faster Linearizability Checker Supporting Multiple Data Model☆18Aug 9, 2018Updated 7 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- A tiny deep learning library written in Java☆27Feb 12, 2023Updated 3 years ago
- A high-performance, concurrent hash table☆25Dec 15, 2013Updated 12 years ago
- 分布式计算:分布式数据处理(流计算/批处理)、消息队列、数据仓库☆25Nov 8, 2025Updated 3 months ago
- ☆18Apr 25, 2017Updated 8 years ago
- Raft backend using LevelDB☆32Jun 11, 2022Updated 3 years ago
- A feature-rich concurrency kit, yet another DAG framework☆10Jan 18, 2026Updated last month
- Simple chatbot created using Rasa☆10Feb 20, 2021Updated 5 years ago
- Distributed KV Storage System based on Raft and RocksDB, can be use to store small files, like images.☆59Mar 12, 2020Updated 5 years ago
- High performance HBase / Spark SQL engine☆28Jul 7, 2022Updated 3 years ago
- The official implementation of ACL2022``Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks''☆34Jan 12, 2023Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- C++ network library☆10Apr 14, 2015Updated 10 years ago
- Paster core module using KiteX☆10Aug 30, 2023Updated 2 years ago
- MIT 6.824 Lab 2012(C++)☆30Mar 13, 2013Updated 12 years ago
- 《智能投顾》读书笔记☆12May 23, 2019Updated 6 years ago
- Kubenetes with SpringBoot demo☆10Feb 20, 2019Updated 7 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- A timer module for Redis☆11Oct 16, 2019Updated 6 years ago
- Second generation of the ICGC DCC release ETL built on Spark☆10Apr 8, 2019Updated 6 years ago
- A Universal Binary JSON (UBJSON) parser, renderer and builder☆10Jul 6, 2013Updated 12 years ago
- hadoop中Map/Reduce使用示例,输入(DBInputFormat),输出(DBOutputFormat)为MySql数据库表、日志分析Grep、单词排序Sort...对HBase的基本操作,增、删、查、改,使用Map/Reduce批量导入数据到HBase表中..…☆14Apr 6, 2013Updated 12 years ago
- Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark☆10Aug 17, 2018Updated 7 years ago
- Java streams utility methods for memoization☆11Dec 12, 2024Updated last year
- How to write interpreters or dynamic compilers for dynamically typed languages on top of the JVM☆16Updated this week
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- My best Java class to compress any String, short or long, with any character of human history☆12Feb 16, 2026Updated last week
- seckill秒杀项目【PRC】☆10Apr 13, 2019Updated 6 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- A lite fast object pool☆49Dec 19, 2025Updated 2 months ago
- Configuration Space Exploration Framework☆17Oct 13, 2020Updated 5 years ago
- ClusterTech Parallel Filesystem☆12May 18, 2018Updated 7 years ago
- RockIt: A query engine for Markov logic☆11May 24, 2016Updated 9 years ago
- 一个简易的正则表达式引擎!☆10Apr 9, 2017Updated 8 years ago
- A job management system for python☆10Jan 16, 2026Updated last month