real time log event processing using spark, kafka & cassandra
☆13Dec 4, 2014Updated 11 years ago
Alternatives and similar repositories for LogEventsProcessingSpark
Users that are interested in LogEventsProcessingSpark are comparing it to the libraries listed below
Sorting:
- Synthetic data generators for simulating real-time data and work loads☆12Nov 6, 2015Updated 10 years ago
- Scripts for the Cassandra Monitoring blog miniseries☆10May 15, 2017Updated 8 years ago
- ☆10Nov 26, 2014Updated 11 years ago
- https://www.packtpub.com/books/info/authors/tomasz-lelek☆12Oct 30, 2021Updated 4 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- A simple toy project for playing around with some implicit resolution tricks☆12May 6, 2021Updated 4 years ago
- These are some code examples☆56Jan 12, 2020Updated 6 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Jul 9, 2024Updated last year
- A simple how to for graphing iostats☆19Apr 25, 2016Updated 9 years ago
- "top"-like tool for Cassandra☆21Apr 13, 2015Updated 10 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 9 years ago
- Data processing of OpenSky COVID-19 Flight Dataset✈️☆14Apr 6, 2024Updated last year
- Docker Compose with Grafana and Prometheus for monitoring Cassandra☆22Mar 26, 2018Updated 7 years ago
- Wide+Deep learning Neural Network Tensorflow☆20Feb 4, 2018Updated 8 years ago
- The ISC Anomaly Detection and Classification Framework implemented for Apache Flink.☆13Dec 14, 2016Updated 9 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆15Jan 4, 2026Updated 2 months ago
- ☆19Jun 22, 2022Updated 3 years ago
- ☆20Jun 12, 2020Updated 5 years ago
- Using Google BERT to classify biomedical papers☆12Mar 22, 2019Updated 6 years ago
- scripts to quickly measure system baseline performance☆23Aug 1, 2025Updated 7 months ago
- Finance 🏦 Data Builder 🛠️ @ postgres 🐘☆22Feb 11, 2021Updated 5 years ago
- ☆17May 25, 2015Updated 10 years ago
- MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regr…☆12Apr 10, 2019Updated 6 years ago
- Analyzing Twitter real time feed with Spark Streaming☆32Feb 27, 2015Updated 11 years ago
- Demonstration of using Caffe2 inside an Android application.☆10Dec 23, 2018Updated 7 years ago
- Python Script to Download Images from Instagram☆23Apr 2, 2016Updated 9 years ago
- Example of use of Spark Streaming with Kafka☆90Jul 11, 2014Updated 11 years ago
- Managing machine learning life-cycle with MLflow tutorial☆23May 1, 2023Updated 2 years ago
- ☆11Mar 13, 2017Updated 9 years ago
- Run Samza as a Spring Boot application☆18Mar 6, 2017Updated 9 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 8 years ago
- ☆14Mar 7, 2015Updated 11 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- Linear Algebra for Machine Learning Book Exercises☆13May 19, 2019Updated 6 years ago
- Consumes Kafka topics specified in the config, and outputs them in chunks as desired in an S3 Bucket. Keeps track of offsets via S3.☆15Sep 6, 2013Updated 12 years ago
- ☆12Jul 27, 2015Updated 10 years ago
- ☆10Nov 3, 2016Updated 9 years ago
- Spark Training Exercises☆25May 11, 2016Updated 9 years ago
- A Python Natural Language Processing Toolkit for Electronic Health Record Texts☆13May 24, 2023Updated 2 years ago