Example project to show how to use Spark to read and write Avro/Parquet files
☆50Aug 21, 2013Updated 12 years ago
Alternatives and similar repositories for spark-parquet-example
Users that are interested in spark-parquet-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example of using Avro and Parquet in Spark SQL☆60Nov 16, 2015Updated 10 years ago
- This is an introduction of Apache Spark DataFrames.☆41Mar 12, 2015Updated 11 years ago
- Simple Spark app that reads and writes Avro data☆31Apr 13, 2015Updated 11 years ago
- ☆21Oct 1, 2015Updated 10 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 11 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Feb 23, 2015Updated 11 years ago
- Single view demo☆14Feb 13, 2016Updated 10 years ago
- Example integration of Kafka, Avro & Spark-Streaming on live Twitter feed☆22Jan 23, 2015Updated 11 years ago
- ☆48Feb 4, 2018Updated 8 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆24Sep 25, 2014Updated 11 years ago
- Materials and Jekyll website for the Wednesday software working group.☆10Feb 17, 2017Updated 9 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Scripts to launch cluster used for Strata☆33Feb 11, 2014Updated 12 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago
- ☆17Oct 20, 2017Updated 8 years ago
- 基于flink1.12,使用java,flink sql的demo,包含Mylsql, flinkcdc内置的Mysqlcdc☆12May 27, 2021Updated 5 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Sep 4, 2015Updated 10 years ago
- Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.☆115Nov 12, 2015Updated 10 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Aug 2, 2016Updated 9 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Jun 19, 2016Updated 9 years ago
- Cucumber-based framework for defining and executing SQL unit, integration and acceptance tests (for AWS Redshift, PostgreSQL)☆13Sep 30, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Mar 26, 2015Updated 11 years ago
- Experiments made with Spark☆15Dec 9, 2014Updated 11 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- API Gateway for *.ik.am☆15Jun 10, 2021Updated 4 years ago
- This sample demonstrates how to make a use of modules provided by Microsoft Azure File Service in Python.☆11Apr 21, 2021Updated 5 years ago
- Interactive web site for reviewing the results of the Achilles R package.☆18Dec 30, 2022Updated 3 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50May 19, 2016Updated 10 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- A Storm based web crawler with Cassandra backend☆29Nov 7, 2013Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My 2nd place submission (working with Kevin Goetsch) out of 28 teams at the Kaggle competition at PyCon2015.☆23Apr 17, 2015Updated 11 years ago
- SequenceIQ Hadoop examples☆115Oct 26, 2015Updated 10 years ago
- SparkStreaming中利用MySQL保存Kafka偏移量保证0数据丢失☆43Aug 2, 2017Updated 8 years ago
- ☆39Aug 19, 2015Updated 10 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆43Dec 16, 2023Updated 2 years ago
- 实时数仓的一些数据处理(mysql、canal、kafka、flink、hbase、kudu等等),以及一堆Flink的练习☆11Jul 1, 2022Updated 3 years ago