pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4
☆69May 14, 2026Updated this week
Alternatives and similar repositories for pyspark-cassandra
Users that are interested in pyspark-cassandra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Jul 20, 2017Updated 8 years ago
- Apache Hadoop - Docker distribution based on CentOS 7 and Oracle Java 8☆12Feb 20, 2018Updated 8 years ago
- Ansible playbooks for Apache Spark on kube☆27Jul 20, 2017Updated 8 years ago
- Hadoop Examples☆10Jul 1, 2022Updated 3 years ago
- Add gevent support to DataStax Python Driver for Apache Cassandra☆11Jun 10, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Mar 1, 2018Updated 8 years ago
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 8 years ago
- ☆14Aug 24, 2021Updated 4 years ago
- Some class materials for a data processing course using PySpark☆52Dec 3, 2022Updated 3 years ago
- ☆10Mar 12, 2021Updated 5 years ago
- Testbench for experimenting with Apache Hive at any data scale.☆64Jul 10, 2017Updated 8 years ago
- Python API for Informatica PowerCenter (pmrep, pmcmd)☆21Sep 17, 2017Updated 8 years ago
- Apache Spark to Apache Cassandra connector☆1,950Apr 29, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Notes from 100 days with Kubernetes☆31Jan 25, 2019Updated 7 years ago
- All my projects on Big Data are provided☆27Dec 5, 2016Updated 9 years ago
- Tools for creating Dataproc custom images☆35Mar 26, 2026Updated last month
- Tools for scraping of twitter data, conversion, text analysis and graph construction☆11Aug 1, 2016Updated 9 years ago
- Python Driver for Apache Cassandra®☆1,428Updated this week
- [NOT MAINTAINED] Create an ElasticSearch cluster with a simple single bash command. Config through environment variables: RAM, cluster na…☆59Jan 26, 2018Updated 8 years ago
- Lab environment based on vagrant to learn ex200/ex300 rhcsa/rhce☆40Mar 16, 2017Updated 9 years ago
- Spark and Python (PySpark) Examples☆39Jul 7, 2021Updated 4 years ago
- A Recurrent Neural Network for classifying the grammaticality of English sentences☆13Mar 15, 2014Updated 12 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆32Aug 19, 2023Updated 2 years ago
- 教師なし品詞タグ推定☆16Mar 22, 2018Updated 8 years ago
- AWS LocalStack + Spark Cluster + Zeppelin [Docker]☆10Jul 6, 2022Updated 3 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- ☆14Dec 10, 2015Updated 10 years ago
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- FSelector R package☆12Aug 22, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Updated this week
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Jan 22, 2019Updated 7 years ago
- Real-world Spark pipelines examples☆82Feb 27, 2018Updated 8 years ago
- Spring + Cloudant =☆13Apr 14, 2024Updated 2 years ago
- Weighted multiple-instance learning algorithm☆18Oct 9, 2018Updated 7 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago