Example Spark project using Parquet as a columnar store with Thrift objects.
☆48Aug 14, 2014Updated 11 years ago
Alternatives and similar repositories for spark-parquet-thrift-example
Users that are interested in spark-parquet-thrift-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example of bioinformatics and bigdata tools can playing nicely together☆14May 17, 2016Updated 9 years ago
- Kaggle's click through rate prediction with Spark Pipeline API☆23Feb 10, 2016Updated 10 years ago
- Bucketing and partitioning system for Parquet☆30May 22, 2018Updated 7 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Jul 24, 2023Updated 2 years ago
- An R package for reading from and writing to a PostgreSQL database☆15Sep 5, 2019Updated 6 years ago
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- Apache NiFi Custom Processor for working with Stanford CoreNLP for Sentiment Analysis in Java 8☆11May 23, 2018Updated 7 years ago
- My configuration and preloaded imports for Ammonite Scala REPL☆10Apr 29, 2021Updated 4 years ago
- SBT plugin building Clojure code☆31Sep 14, 2022Updated 3 years ago
- An HFile-backed Key-Value Server☆43Apr 26, 2019Updated 6 years ago
- Hands On Lab☆40Nov 16, 2022Updated 3 years ago
- Examples of spark-lucenerdd☆15Oct 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Native non-blocking client for ZooKeeper with Finagle☆54May 11, 2015Updated 10 years ago
- Examples for "Property Based Testing for Better Code"☆16Aug 8, 2014Updated 11 years ago
- A cookbook for installing and configuring Apache Spark☆11Sep 6, 2018Updated 7 years ago
- Unicode goodness for Scala code by using vim's “conceal” feature☆18Dec 24, 2014Updated 11 years ago
- Fast Parser Combinators☆27Feb 1, 2015Updated 11 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Mar 14, 2017Updated 9 years ago
- Sample code to help with Elastic Block Store automation with Elastic Volumes feature☆12Feb 24, 2017Updated 9 years ago
- Datasets for Hyperparameter Optimization of Neural Machine Translation☆10Aug 19, 2024Updated last year
- As this has moved to Databricks, please go to: https://github.com/databricks/spark-xml☆15Dec 16, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Feb 14, 2026Updated last month
- sbt-web plugin for gzipping assets☆25Mar 5, 2026Updated 3 weeks ago
- Fast Fuzzy String matching dictionary for Scala☆10Mar 20, 2015Updated 11 years ago
- ☆13Mar 8, 2024Updated 2 years ago
- A better solution for building multiple Scala versions (cross compiling) in SBT☆51Apr 29, 2020Updated 5 years ago
- A fast, streaming-friendly, type-safe, pure-Scala MessagePack library. Supercharge your microservices today!☆61Jun 6, 2021Updated 4 years ago
- Examples using R packages which use htmlwidgets☆19May 20, 2020Updated 5 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Sep 23, 2016Updated 9 years ago
- Docker containerizer for Mesos☆44May 21, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Inofficial scodec cheatsheet☆26Mar 17, 2017Updated 9 years ago
- front end to view akka cluster topography☆48Jun 27, 2017Updated 8 years ago
- Skeleton of a home lab for learning about DevOps from an infrastructure perspective☆10Mar 2, 2017Updated 9 years ago
- ☆17Jan 2, 2026Updated 2 months ago
- Wikidata processing with Akka streams Proof of Concept☆52Jun 26, 2015Updated 10 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- A command-line utility for generating optimum polygon label coordinates from GeoJSON☆12Mar 20, 2023Updated 3 years ago