TeraSort for Spark and Flink which uses a range partitioner based on sampling
☆22Feb 5, 2016Updated 10 years ago
Alternatives and similar repositories for terasort
Users that are interested in terasort are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Color scheme inspired by the art of Rubens LP☆11Jul 14, 2015Updated 10 years ago
- An AI agent to create short stories, using Gemini and Imagen for illustrations. The project is developed in Java 21 with LangChain4j, and…☆11Sep 4, 2025Updated 6 months ago
- A set of base classes in order to perfom training scripts for Neural Networs ( by means of SNNS) and SVM ( by means of SVM Light and SVM …☆14Jun 24, 2011Updated 14 years ago
- SSM框架构建商城+论坛☆15Jun 30, 2018Updated 7 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A simple toy project for playing around with some implicit resolution tricks☆12May 6, 2021Updated 4 years ago
- The Definitive Guide to Modernizing Applications on Google Cloud, published by Packt☆12Jan 30, 2023Updated 3 years ago
- spark MLlib机器学习实践源码☆10Oct 28, 2016Updated 9 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Oct 26, 2017Updated 8 years ago
- Demo of DuckDB Spark API implements. Same Pyspark code, but DuckDB under the hood☆15Nov 16, 2023Updated 2 years ago
- ☆80Mar 3, 2026Updated 3 weeks ago
- simbot框架下,mirai组件的springboot快速启动器(starter)☆12Jan 1, 2022Updated 4 years ago
- ☆26Sep 2, 2017Updated 8 years ago
- a small simple slow serial FPGA core☆16Mar 11, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Spark Terasort☆122Apr 21, 2023Updated 2 years ago
- Influence Maximization Paper List☆11May 11, 2022Updated 3 years ago
- Enhanced Jenkins with Docker, Mesos and Marathon☆10Jun 29, 2015Updated 10 years ago
- Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs☆13Feb 13, 2017Updated 9 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Aug 5, 2020Updated 5 years ago
- Kira is an astronomy image processing toolkit implemented with Apache Spark.☆15Feb 9, 2016Updated 10 years ago
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- 基于深度学习-卷积神经网络训练而成的模型来动态识别手写体数字识别, 准确率达到:99.64%☆12Mar 23, 2020Updated 6 years ago
- NOW PART OF SHOULD.JS DO NOT USE ANYMORE☆13Nov 19, 2015Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- Using Google BERT to classify biomedical papers☆12Mar 22, 2019Updated 7 years ago
- ☆17May 25, 2015Updated 10 years ago
- Upstream eglibc + xilinx branches☆18Nov 7, 2013Updated 12 years ago
- MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regr…☆12Apr 10, 2019Updated 6 years ago
- Fabric8 Maven plugin to deploy Java applications to Kubernetes☆17Oct 16, 2025Updated 5 months ago
- this folder contains different algorithms implemented on FPGA☆12Dec 30, 2023Updated 2 years ago
- This package contains the code for executing clustering validity indices in Spark. The package includes BD-Silhouette, BD-Dunn, Davies-Bo…☆10Oct 29, 2018Updated 7 years ago
- Scripts to analyze Spark's performance☆136May 20, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 简易的模型监控界面:定期更新的用户信用分及特征分布☆16Jan 12, 2018Updated 8 years ago
- multi objective, single objective optimization, genetic algorithm for multi-objective optimization, particle swarm intelligence, ... impl…☆15May 17, 2020Updated 5 years ago
- ☆14Apr 27, 2021Updated 4 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Jan 13, 2017Updated 9 years ago
- real time log event processing using spark, kafka & cassandra☆13Dec 4, 2014Updated 11 years ago
- SystemVerilog implementation of the AHB to TileLink UL (Uncached Lightweight) bridge☆13Sep 9, 2022Updated 3 years ago
- Temporal IMLinUCB - a solution for Online Influence Maximization problem in Temporal Networks (based on IMLinUCB)☆17May 3, 2024Updated last year