Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc
☆52Aug 25, 2016Updated 9 years ago
Alternatives and similar repositories for Realtime-Data-Analytics-Using-Spark
Users that are interested in Realtime-Data-Analytics-Using-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆11Aug 11, 2016Updated 9 years ago
- ☆14Nov 3, 2016Updated 9 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Jun 7, 2021Updated 5 years ago
- spark流数据处理,可以从flume-ng,kafka接收数据☆11Sep 16, 2015Updated 10 years ago
- ☆12May 11, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆18Feb 27, 2014Updated 12 years ago
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- A maven project with codes of the book "Hadoop: The Definitive Guide(Fourth Edition)"☆10Jul 17, 2023Updated 2 years ago
- Mastering Spark for Data Science, published by Packt☆50Apr 22, 2026Updated 2 months ago
- Your first Apache Spark model :)☆21Jun 16, 2020Updated 6 years ago
- This is the reposiory for learning to code in Python. I will be uploading the files to this repository and I will be walking through thes…☆16Feb 13, 2019Updated 7 years ago
- Jupyter notebooks for pulling and analyzing data from social media during crises☆13May 25, 2017Updated 9 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Website of Question Answer Generation☆17Feb 2, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code to support Databases blog post - How to offload data from your transactional NoSQL database to Amazon S3, perform advanced analytics…☆15Mar 26, 2020Updated 6 years ago
- Multiple coding projects completed in Python☆11Jun 10, 2014Updated 12 years ago
- Unfinished work on a general-purpose Taskcluster worker☆19Feb 26, 2020Updated 6 years ago
- Classification problem to predict loan defaulters using Lending Club Dataset☆11Jan 26, 2019Updated 7 years ago
- Convert trained XGBoost model object in R to SQL script☆24Dec 12, 2025Updated 6 months ago
- conbine flume,spark-streaming and redis for real-time computing☆22Oct 20, 2014Updated 11 years ago
- This is the collection of some handy tips running Nexus Repository Manager OSS☆14Aug 20, 2016Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Jan 15, 2019Updated 7 years ago
- Simple wrapper over SOLR to emulate Azure Search (for development only)☆12Jul 8, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆152Apr 4, 2018Updated 8 years ago
- 基于Spark的实时日志分析及异常检测系统 Flume + Kafka + Hbase + Spark-Streaming + Scala☆13Mar 12, 2019Updated 7 years ago
- Repository for code used in Kaggle competitions.☆22Oct 5, 2018Updated 7 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- Data Engineering Project at Insight☆15Nov 17, 2015Updated 10 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆27Nov 21, 2016Updated 9 years ago
- Architecture of Streaming Twitter Data into Apache Kafka cluster, performing simple sentiment analysis with afinn module, storing the dat…☆20Jan 3, 2020Updated 6 years ago
- ☆25May 7, 2020Updated 6 years ago
- Decentralised Energy Market☆12Feb 19, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Create scalable machine learning applications to power a modern data-driven business using Spark☆61Jan 30, 2023Updated 3 years ago
- I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved…☆17May 7, 2016Updated 10 years ago
- A copy of the source for Grinstead and Snell's lovely probability book☆13Dec 20, 2015Updated 10 years ago
- ☆10Jan 14, 2015Updated 11 years ago
- IPython Notebook for Sentiment Classification☆10Nov 12, 2014Updated 11 years ago
- Python code for listening to Streaming APIs☆15Mar 31, 2021Updated 5 years ago
- Modeling methods of System Dynamics – Supply Chain Simulation using the Anylogic software☆10Jan 8, 2026Updated 5 months ago