Read SparkSQL parquet file as RDD[Protobuf]
☆93Oct 12, 2018Updated 7 years ago
Alternatives and similar repositories for sparksql-protobuf
Users that are interested in sparksql-protobuf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Low level integration of Spark and Kafka☆131Mar 15, 2018Updated 8 years ago
- DCOS Zeppelin package☆16May 2, 2019Updated 6 years ago
- Test for SparkSQL ScalaPB☆14Jun 28, 2022Updated 3 years ago
- Read druid segments from hadoop☆10Jan 18, 2017Updated 9 years ago
- Geospatial visualization for Apache Zeppelin using the Leaflet map library.☆12Dec 11, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example finatra project to get you started☆25Jan 9, 2015Updated 11 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Aug 3, 2018Updated 7 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Sangria monix integration☆10Apr 12, 2026Updated last week
- On demand presto cluster with mesos, marathon and docker.☆29Mar 7, 2018Updated 8 years ago
- An application to monitor and drive the Spark JobServer☆12Dec 12, 2014Updated 11 years ago
- The Scalding tutorial as a standalone SBT project☆51Oct 16, 2017Updated 8 years ago
- Approximate cardinality estimation with HyperLogLog, as a Hive function☆42Dec 17, 2012Updated 13 years ago
- Scala extensions for the Kryo serialization library☆620Aug 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 9 years ago
- Utilities for building distributed systems on top of mesos☆23Aug 25, 2018Updated 7 years ago
- Coursera Machine Learning class examples in Spark☆43Feb 14, 2014Updated 12 years ago
- Helm Chart for lyft/flinkk8soperator☆11Mar 10, 2020Updated 6 years ago
- Snappy Flows☆28Aug 16, 2023Updated 2 years ago
- A paper catalog on Data Management Area for last five years.☆26Sep 10, 2015Updated 10 years ago
- ☆18Apr 7, 2026Updated last week
- A library for reading and writing Protobuf3 data from Spark RDDs.☆11Jun 28, 2020Updated 5 years ago
- A Pelican plugin to generate PDF resumes automatically from a Pelican page in Markdown☆11Feb 8, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- Mirror of Apache HCatalog☆59Apr 14, 2023Updated 3 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated last month
- Use Cascading Taps and Scalding DSL with Spark☆49Dec 28, 2016Updated 9 years ago
- Scala library for fitting linear and generalised linear statistical models☆29Dec 29, 2024Updated last year
- A port of gears.c to Scala using Scala Native☆15Sep 26, 2018Updated 7 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,134Apr 10, 2023Updated 3 years ago
- A tool for running Spark on Google Compute Engine☆16Jan 20, 2017Updated 9 years ago
- Graphite reporter for Kafka Offset Monitor.☆44Sep 23, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Generate Scala case class definitions from Avro schemas☆204Apr 9, 2026Updated last week
- ☆33Mar 12, 2017Updated 9 years ago
- Spark Library for Bulk Loading into Cassandra☆12Apr 18, 2018Updated 8 years ago
- An Apache FreeMarker template resolver for the sbt new command☆12Aug 12, 2017Updated 8 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- ☆92Apr 17, 2017Updated 9 years ago
- A TypeScript compiler written in Scala (wip)☆17Nov 20, 2016Updated 9 years ago