TeraSort for Spark and Flink which uses a range partitioner based on sampling
☆22Feb 5, 2016Updated 10 years ago
Alternatives and similar repositories for terasort
Users that are interested in terasort are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 16, 2022Updated 3 years ago
- An AI agent to create short stories, using Gemini and Imagen for illustrations. The project is developed in Java 21 with LangChain4j, and…☆13Sep 4, 2025Updated 7 months ago
- A set of base classes in order to perfom training scripts for Neural Networs ( by means of SNNS) and SVM ( by means of SVM Light and SVM …☆14Jun 24, 2011Updated 14 years ago
- SSM框架构建商城+论坛☆15Jun 30, 2018Updated 7 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Apr 8, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Java's NIO APIs cache direct ByteBuffers, causing a native memory leak.☆21Jan 3, 2016Updated 10 years ago
- ☆13Apr 22, 2023Updated 2 years ago
- The Definitive Guide to Modernizing Applications on Google Cloud, published by Packt☆12Jan 30, 2023Updated 3 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Oct 26, 2017Updated 8 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- ☆26Sep 2, 2017Updated 8 years ago
- Spark Terasort☆122Apr 21, 2023Updated 2 years ago
- Parallel Particle Swarm Optimizer on the Spark Clustering Computing Platform.☆12Oct 29, 2018Updated 7 years ago
- Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs☆13Feb 13, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Kira is an astronomy image processing toolkit implemented with Apache Spark.☆15Feb 9, 2016Updated 10 years ago
- The ISC Anomaly Detection and Classification Framework implemented for Apache Flink.☆13Dec 14, 2016Updated 9 years ago
- The cloudopting core manager☆10Nov 19, 2022Updated 3 years ago
- NOW PART OF SHOULD.JS DO NOT USE ANYMORE☆13Nov 19, 2015Updated 10 years ago
- Basic dynamically loadable extension for HHVM☆30Nov 30, 2016Updated 9 years ago
- Wiki☆12Sep 28, 2015Updated 10 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- ☆17May 13, 2018Updated 7 years ago
- ☆17May 25, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regr…☆12Apr 10, 2019Updated 7 years ago
- ☆12Sep 25, 2021Updated 4 years ago
- Scripts to analyze Spark's performance☆136May 20, 2018Updated 7 years ago
- Next-generation Cassandra Conference, September 26, 2017☆12Aug 23, 2018Updated 7 years ago
- 简易的模型监控界面:定期更新的用户信用分及特征分布☆16Jan 12, 2018Updated 8 years ago
- multi objective, single objective optimization, genetic algorithm for multi-objective optimization, particle swarm intelligence, ... impl…☆15May 17, 2020Updated 5 years ago
- A tool visualization of Tree(Query Plan) in Postgresql☆14May 15, 2023Updated 2 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Jan 13, 2017Updated 9 years ago
- Examples for Java Concurrency Stress (jcstress) tests with gradle integration☆17Oct 8, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- real time log event processing using spark, kafka & cassandra☆13Dec 4, 2014Updated 11 years ago
- Run Samza as a Spring Boot application☆18Mar 6, 2017Updated 9 years ago
- Multi-objective particle swarm optimization algorithm in .m☆12May 9, 2020Updated 5 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 8 years ago
- Scripts for the Cassandra Monitoring blog miniseries☆10May 15, 2017Updated 8 years ago
- ☆14Jan 17, 2019Updated 7 years ago
- Port of TPC-DS data generator to Java☆13Aug 1, 2017Updated 8 years ago