Parsely / pyspark-cassandraLinks
Utilities and examples to asssist in working with PySpark and Cassandra.
☆36Updated 10 years ago
Alternatives and similar repositories for pyspark-cassandra
Users that are interested in pyspark-cassandra are comparing it to the libraries listed below
Sorting:
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆139Updated 8 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 10 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- from zero to storm cluster for realtime classification using sklearn☆12Updated 10 years ago
- Ferry lets you define, run, and deploy big data applications on AWS, OpenStack, and your local machine using Docker☆253Updated 10 years ago
- Deploy Dask on Marathon☆10Updated 8 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- ☆24Updated 9 years ago
- Luigi Plugin for Hubot☆36Updated 8 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Periscope brings SLA policy based autoscaling to Hadoop☆35Updated 9 years ago
- Exelixi is a distributed framework for running genetic algorithms at scale. The framework is based on Apache Mesos and the code is mostly…☆34Updated 11 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 9 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- functionstest☆33Updated 8 years ago
- An Ansible role for installing Apache Spark.☆58Updated 6 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆41Updated 7 years ago
- Spark-cloud is a set of scripts for starting spark clusters on ec2☆12Updated 9 years ago
- Docker image for apache zeppelin☆38Updated 8 years ago
- ☆23Updated 8 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago