This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.
☆42Aug 20, 2022Updated 3 years ago
Alternatives and similar repositories for bigdata
Users that are interested in bigdata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Learning Projects and Learning Content☆489Jan 21, 2024Updated 2 years ago
- ☆54Jul 1, 2022Updated 3 years ago
- Exploratory Data Analysis ; Deep Learning 3;☆12Aug 31, 2018Updated 7 years ago
- Understanding of POS tags and build a POS tagger from scratch☆11Jun 9, 2018Updated 7 years ago
- A tutorial for using Hadoop with Python and Hive☆10May 26, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Example project - how to implement JWT authentication in Dart☆13Mar 31, 2021Updated 5 years ago
- Machine Learning for Internet of Things☆12Jul 24, 2019Updated 6 years ago
- Richmond JUG Oct 18 2017☆16Aug 29, 2018Updated 7 years ago
- This Repo leverages all the things we can do with ADB commands.☆14Jun 1, 2021Updated 4 years ago
- ☆17Oct 8, 2018Updated 7 years ago
- Spark in Action, 2e - chapter 10 - Ingestion through structured streaming☆15Jan 4, 2022Updated 4 years ago
- Simple Recommender System for Viblo Website using LDA (Latent Dirichlet Allocation)☆15Apr 9, 2019Updated 7 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆30Aug 26, 2020Updated 5 years ago
- Built with Gradle & Travis CI | Backend with Java Spring/Spring Boot, RESTful API and Heroku(PostgreSQL) + Web Frontend in Vue.js and Boo…☆16Jul 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆14Jun 25, 2023Updated 2 years ago
- Best Practices and Style Guides in BEEVA☆26Jun 22, 2018Updated 7 years ago
- Deep Learning Projects on TensorFlow and Keras☆20Jun 13, 2024Updated last year
- Project files for my Spring Core Dev Ops Course☆26Jun 3, 2025Updated 11 months ago
- A Genetic Algorithms framework for Hadoop MapReduce.☆10May 30, 2018Updated 7 years ago
- ☆22Jul 14, 2020Updated 5 years ago
- ☆18Nov 13, 2020Updated 5 years ago
- Arduino and Raspberry PI Development Kit☆10Jun 28, 2017Updated 8 years ago
- About Let The Data Confess☆10May 2, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple SOCKS v4a proxy server written in C# (.NET Standard 2.0) which forwards all traffic to a SOCKS v5 server.☆33Dec 31, 2022Updated 3 years ago
- PySpark Tutorials and Materials☆19Mar 1, 2021Updated 5 years ago
- Shanoir (SHAring iN vivO Imaging Resources)☆26May 12, 2026Updated last week
- This will contain study material for agile and scrum guide, nexus guide, manifesto etc☆12Jan 22, 2022Updated 4 years ago
- This repo provides the Kubernetes Helm chart for deploying Pyspark Notebook.☆17Nov 16, 2022Updated 3 years ago
- Learn to use the Unix command-line tools and Bash shell scripting☆27Apr 25, 2020Updated 6 years ago
- Free resources for learning data science☆22May 6, 2018Updated 8 years ago
- Command-line tool to generate Python applications and libraries☆11May 13, 2025Updated last year
- NLP Resources for Indian Languages☆10Nov 9, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated 11 months ago
- ☆118Sep 21, 2020Updated 5 years ago
- Code that goes along with https://humansofdata.atlan.com/2018/06/apache-airflow-disease-outbreaks-india/☆23Jun 30, 2023Updated 2 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆32May 21, 2020Updated 5 years ago
- Deep Learning Specialization course by IIT Roorkee (Using python, numpy, pandas, sklearn,TensorFlow 2)☆26Apr 12, 2024Updated 2 years ago
- Overview of Bayesian Deep Learning☆11Apr 24, 2019Updated 7 years ago
- Build an RNN in Keras used for predicting stock prices.☆10May 8, 2018Updated 8 years ago