anguenot / pyspark-cassandraView external linksLinks
pyspark-cassandra is a Python port of the awesome @datastax Spark Cassandra connector. Compatible w/ Spark 2.0, 2.1, 2.2, 2.3 and 2.4
☆69Oct 15, 2024Updated last year
Alternatives and similar repositories for pyspark-cassandra
Users that are interested in pyspark-cassandra are comparing it to the libraries listed below
Sorting:
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Jul 20, 2017Updated 8 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Mar 1, 2018Updated 7 years ago
- Create LAMP Stack using terraform with AWS☆11Feb 15, 2023Updated 3 years ago
- Ansible Playbook to create LAMP in CentOS 7 with Apache, MySQL, PHP.☆10Dec 28, 2018Updated 7 years ago
- Ansible playbooks for Apache Spark on kube☆27Jul 20, 2017Updated 8 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 7 years ago
- Add gevent support to DataStax Python Driver for Apache Cassandra☆11Jun 10, 2020Updated 5 years ago
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Some class materials for a data processing course using PySpark☆52Dec 3, 2022Updated 3 years ago
- ☆15Aug 16, 2018Updated 7 years ago
- ☆14Aug 24, 2021Updated 4 years ago
- Apache Spark to Apache Cassandra connector☆1,949Apr 29, 2025Updated 9 months ago
- ☆21Feb 10, 2026Updated last week
- Testbench for experimenting with Apache Hive at any data scale.☆64Jul 10, 2017Updated 8 years ago
- Python API for Informatica PowerCenter (pmrep, pmcmd)☆21Sep 17, 2017Updated 8 years ago
- Specific correspondence analysis in R☆14Aug 25, 2025Updated 5 months ago
- An example project for running compute intensive Celery workers using AWS Batch.☆27Jun 3, 2025Updated 8 months ago
- Introduction to Data Science with Python☆12Jan 28, 2019Updated 7 years ago
- Python Driver for Apache Cassandra®☆1,423Feb 5, 2026Updated last week
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- AWS LocalStack + Spark Cluster + Zeppelin [Docker]☆10Jul 6, 2022Updated 3 years ago
- ☆10Feb 13, 2024Updated 2 years ago
- MultiOCR, an interface that connects multiple open-source OCR and various Cloud OCR.☆32Aug 19, 2023Updated 2 years ago
- This repository contains my ML scripts in R☆33Jul 10, 2017Updated 8 years ago
- My Reusable Notes☆26Jun 25, 2020Updated 5 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Jan 15, 2019Updated 7 years ago
- Spark with Scala example projects☆34Apr 17, 2019Updated 6 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- DevOps☆16May 17, 2021Updated 4 years ago
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Jun 19, 2018Updated 7 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- Record videos of your animated SVG☆10Feb 16, 2024Updated 2 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆21Jan 19, 2026Updated 3 weeks ago
- Ansible crash course☆39May 3, 2019Updated 6 years ago
- Structural Topic Modeling of the Facebook posts of NC State Senators☆13Mar 17, 2017Updated 8 years ago
- Digit classification with Convolutional Neural Networks using Keras☆20May 12, 2018Updated 7 years ago
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 3 years ago
- REDCap Electronic Data - I (Ingester/Integrator/Importer)☆10Oct 15, 2018Updated 7 years ago