Dukecat0613 / Big-Data
Big data technologies
☆11Updated 8 years ago
Alternatives and similar repositories for Big-Data:
Users that are interested in Big-Data are comparing it to the libraries listed below
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- This is a simple streaming application that utilises Kafka and Python☆45Updated 6 years ago
- ☆35Updated 2 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- ☆24Updated 9 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 8 years ago
- ☆16Updated 8 years ago
- TAC is an airflow plugin which helps you to Extract transform and Load your data, bit more easily☆9Updated 7 years ago
- DataFlow GUI is a desktop application for constructing Big Data programs through building DAG☆12Updated 7 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana☆20Updated 7 years ago
- The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacio…☆62Updated 6 years ago
- ☆53Updated 2 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Spark in Kaggle competitions☆10Updated 9 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆17Updated 8 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- ☆8Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- Flask app to push/pull on Kafka over HTTP☆41Updated 10 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- ☆26Updated last year
- ☆14Updated 9 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago