massie / spark-parquet-exampleView external linksLinks
Example project to show how to use Spark to read and write Avro/Parquet files
☆50Aug 21, 2013Updated 12 years ago
Alternatives and similar repositories for spark-parquet-example
Users that are interested in spark-parquet-example are comparing it to the libraries listed below
Sorting:
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- This is an introduction of Apache Spark DataFrames.☆41Mar 12, 2015Updated 10 years ago
- Simple Spark app that reads and writes Avro data☆31Apr 13, 2015Updated 10 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Materials and Jekyll website for the Wednesday software working group.☆10Feb 17, 2017Updated 8 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- ☆48Feb 4, 2018Updated 8 years ago
- Experiments made with Spark☆15Dec 9, 2014Updated 11 years ago
- Example integration of Kafka, Avro & Spark-Streaming on live Twitter feed☆23Jan 23, 2015Updated 11 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 10 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 9 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Feb 27, 2014Updated 11 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- functionstest☆33Oct 25, 2016Updated 9 years ago
- Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.☆115Nov 12, 2015Updated 10 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Sep 25, 2014Updated 11 years ago
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Jan 23, 2016Updated 10 years ago
- Scripts to launch cluster used for Strata☆33Feb 11, 2014Updated 12 years ago
- An example of bioinformatics and bigdata tools can playing nicely together☆14May 17, 2016Updated 9 years ago
- KDD Hands-On Tutorial (2018)☆29Dec 8, 2022Updated 3 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Nov 30, 2014Updated 11 years ago
- personal cheatsheets on various technologies☆25Sep 5, 2016Updated 9 years ago
- Vertica Machine Learning examples and example data.☆25Nov 15, 2023Updated 2 years ago
- Ambari service to deploy/manage Hortonworks IoT demo☆22Apr 27, 2017Updated 8 years ago
- SequenceIQ Hadoop examples☆115Oct 26, 2015Updated 10 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50May 19, 2016Updated 9 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52May 13, 2016Updated 9 years ago
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28May 15, 2014Updated 11 years ago
- Kaggle's click through rate prediction with Spark Pipeline API☆23Feb 10, 2016Updated 10 years ago
- Zeppelin notebook examples☆25Feb 18, 2016Updated 9 years ago
- Translation layer between the GDC Data Dictionary and psqlgraph☆28Jun 27, 2025Updated 7 months ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Jan 30, 2017Updated 9 years ago
- ☆25Oct 12, 2016Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- This project is for examples of how to use Zeppelin. https://github.com/apache/incubator-zeppelin☆25Jan 27, 2016Updated 10 years ago
- ☆24Jul 2, 2015Updated 10 years ago