This is an introduction of Apache Spark DataFrames.
☆41Mar 12, 2015Updated 11 years ago
Alternatives and similar repositories for spark-dataframe-introduction
Users that are interested in spark-dataframe-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Another, hopefully better, implementation of ALS on Spark☆14May 20, 2015Updated 11 years ago
- A simple tutorial application for working with Twitter4j using Scala.☆14Feb 26, 2013Updated 13 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 9 years ago
- Spark GCE Script Helps you deploy Spark cluster on Google Cloud.☆43May 30, 2015Updated 10 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Scala library for locality sensitive hashing☆14Aug 1, 2018Updated 7 years ago
- Source code of Blog at☆51Sep 17, 2025Updated 8 months ago
- Will come later...☆20Jul 1, 2022Updated 3 years ago
- Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)☆10Aug 16, 2018Updated 7 years ago
- Cucumber-based framework for defining and executing SQL unit, integration and acceptance tests (for AWS Redshift, PostgreSQL)☆13Sep 30, 2020Updated 5 years ago
- An example PySpark project with pytest☆18Oct 13, 2017Updated 8 years ago
- Example integration of Kafka, Avro & Spark-Streaming on live Twitter feed☆22Jan 23, 2015Updated 11 years ago
- A gradle plugin that enables it to handle .thrift idl files and generate them with Thrift or Scrooge☆13Jan 31, 2020Updated 6 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Sample App. Amazon Product Descriptions Wordcloud. Spark Streaming, Algebird, Storehaus, Redis, Scala Scraper, OpenNLP, Play Framework, D…☆12Nov 9, 2015Updated 10 years ago
- Fork from python/cpython☆12Dec 5, 2018Updated 7 years ago
- The released version of Astro(Spark SQL on HBase) has been moved to:☆16Jul 23, 2015Updated 10 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- scala and spark examples project☆14Feb 19, 2018Updated 8 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Recommendation Web Service☆17Apr 17, 2013Updated 13 years ago
- TodoMVC implementation for Widok☆35Jul 2, 2015Updated 10 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆24Sep 25, 2014Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Nov 3, 2016Updated 9 years ago
- ☆27Apr 15, 2017Updated 9 years ago
- Sprint Planning / Scrum Poker online tool (Akka/Socko Websockets)☆19Dec 22, 2015Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- ☆10Jun 7, 2020Updated 5 years ago
- Simple NLP Search - Dataset Generator☆17Apr 29, 2016Updated 10 years ago
- ☆13Sep 19, 2022Updated 3 years ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- Scalable Consistency Adjustable Data Storage☆45Oct 13, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Nov 18, 2014Updated 11 years ago
- Maven archetype used to bootstrap a Spark Scala project☆26Sep 1, 2015Updated 10 years ago
- Role which helps to manage ulimit configuration☆11Apr 27, 2015Updated 11 years ago
- A Content Anomaly Detector based on n-Grams☆24Jun 17, 2016Updated 9 years ago
- kafka-connect-jdbc system test based on testcontainers☆13Sep 29, 2023Updated 2 years ago
- ☆12Feb 19, 2017Updated 9 years ago
- Scala framework for efficient sequential and data-parallel collections -☆173Aug 4, 2014Updated 11 years ago