Getting started with Spark, Spark streaming, Spark SQL and DataFrame.
☆48May 15, 2018Updated 7 years ago
Alternatives and similar repositories for spark-in-practice-scala
Users that are interested in spark-in-practice-scala are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Getting started with Spark, Spark Streaming, Spark SQL, DataFrame☆35Apr 24, 2016Updated 9 years ago
- Learning Spark SQL, published by Packt☆43Jan 30, 2023Updated 3 years ago
- Some exercises to learn Spark. Solved in Python.☆21Oct 15, 2024Updated last year
- ☆10Aug 28, 2018Updated 7 years ago
- ☆14Aug 23, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Example of reading/writing Excel files from Pandas/Python☆14Dec 10, 2014Updated 11 years ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆16Mar 22, 2017Updated 9 years ago
- ☆16May 9, 2018Updated 7 years ago
- Example integration of Kafka, Avro & Spark-Streaming on live Twitter feed☆23Jan 23, 2015Updated 11 years ago
- Different entries to kaggle contests using Apache Spark☆13Jun 5, 2017Updated 8 years ago
- A simple Scala Based Project Template for Apache Spark☆21Oct 21, 2016Updated 9 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆198Apr 15, 2018Updated 7 years ago
- Scala examples for learning to use Spark☆445Sep 17, 2020Updated 5 years ago
- 学堂在线使用Cookiesfile登陆批量下载课程视频 ,不需要输入用户名和密码☆17May 4, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- ☆11Oct 8, 2015Updated 10 years ago
- A Spark Streaming App to analyze the popular hashtags based on keywords☆24Feb 26, 2017Updated 9 years ago
- ScalaCheck for Spark☆63Apr 2, 2018Updated 7 years ago
- SparkLearning_NoData, including code,pom and so on☆13Mar 21, 2017Updated 9 years ago
- My blogs☆47Apr 13, 2016Updated 9 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆215Oct 12, 2016Updated 9 years ago
- Spark 学习之路,包含 Spark Core,Spark SQL,Spark Streaming,Spark mllib 学习笔记☆146Jul 3, 2018Updated 7 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A text classification model based on RNN(recurrent neural network)☆23Oct 10, 2017Updated 8 years ago
- An open source stream generator which generates reproducible and deterministic out-of-order streams, simulating arbitrary fractions of ou…☆12May 14, 2019Updated 6 years ago
- ☆35Jun 23, 2016Updated 9 years ago
- My submissions for the Coursera MOOC "Big Data Analysis with Scala and Spark" given by EPFL.☆52Mar 24, 2017Updated 9 years ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 8 years ago
- Tensorflow Implementation of the paper "Topology Adaptive Graph Convolutional Networks" (Du et al., 2017)☆19Dec 3, 2025Updated 3 months ago
- Scala练习项目:包括scala基础知识,Spark RDD,DataFrame,Spark SQL,spark与HDFS、Phoenix、Hbase交互。☆11Nov 11, 2022Updated 3 years ago
- Coding interview questions with solutions and tests (Scala)☆26Sep 23, 2025Updated 6 months ago
- Enterprise SQL-on-Hadoop Solution [Season One]☆34Mar 3, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Examples of Spark 2.0☆212Aug 11, 2021Updated 4 years ago
- Sample files for Pinot tutorial☆18May 10, 2024Updated last year
- I implemented various ETL processes like loading the data using sqoop from mysql to hdfs, transform the data using Spark and Scala, perfo…☆10Oct 20, 2017Updated 8 years ago
- A free tutorial for Apache Spark.☆992Jan 5, 2026Updated 2 months ago
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- The ScaleOut Time Windowing Library for Java provides a set of windowing functions for time-ordered lists of events.☆21May 11, 2018Updated 7 years ago
- Optimizing Databricks Workload, published by Packt☆18Mar 2, 2026Updated 3 weeks ago