zenkay/bigdata-ecosystem

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zenkay/bigdata-ecosystem)

zenkay / bigdata-ecosystem

BigData Ecosystem Dataset

☆581

Alternatives and similar repositories for bigdata-ecosystem

Users that are interested in bigdata-ecosystem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

youngwookim / awesome-hadoop
View on GitHub
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources
☆1,117May 7, 2024Updated 2 years ago
oxnr / awesome-bigdata
View on GitHub
A curated list of awesome big data frameworks, ressources and other awesomeness.
☆14,512May 19, 2026Updated 2 months ago
bahaaldine / scalable-big-data-architecture
View on GitHub
Assets used in Apress -- Scalable Big Data Architecture -- book
☆18Dec 11, 2015Updated 10 years ago
haifengl / bigdata
View on GitHub
Introduction to Big Data
☆397May 14, 2024Updated 2 years ago
jaibeermalik / searchanalytics-bigdata
View on GitHub
Customer Product search clicks analytics using big data Hadoop, Hive, Oozie, ElasticSearch, Akka, Spring Data
☆72Oct 5, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TIBCOSoftware / snappydata
View on GitHub
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…
☆1,032Nov 21, 2022Updated 3 years ago
okulbilisim / awesome-big-o
View on GitHub
A curated list of awesome materials about Big O notation
☆108Jul 24, 2021Updated 5 years ago
manuzhang / awesome-streaming
View on GitHub
a curated list of awesome streaming frameworks, applications, etc
☆2,999Updated this week
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,269Updated this week
silky / awesome-open-science
View on GitHub
some links to projects/tools related to "open science".
☆157Nov 12, 2025Updated 8 months ago
mbonaci / spark-archetype-scala
View on GitHub
Maven archetype used to bootstrap a Spark Scala project
☆26Sep 1, 2015Updated 10 years ago
hadoopecosystemtable / hadoopecosystemtable.github.io
View on GitHub
This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open sour…
☆688Mar 4, 2021Updated 5 years ago
kmonsoor / data-must-watch
View on GitHub
Must-watch videos on data-science
☆47Nov 30, 2015Updated 10 years ago
OryxProject / oryx
View on GitHub
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
☆1,783Aug 16, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
oxnr / awesome-learning
View on GitHub
learning related projects
☆17Jan 26, 2015Updated 11 years ago
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago
igorbarinov / awesome-data-engineering
View on GitHub
A curated list of data engineering tools for software developers
☆8,902Jul 18, 2026Updated last week
apache / predictionio
View on GitHub
PredictionIO, a machine learning server for developers and ML engineers.
☆12,520Jan 9, 2021Updated 5 years ago
ibm-watson-data-lab / spark.samples
View on GitHub
tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark
☆78Mar 16, 2018Updated 8 years ago
spark-notebook / spark-notebook
View on GitHub
Interactive and Reactive Data Science using Scala and Spark.
☆3,142May 16, 2023Updated 3 years ago
dcenergy / rflot
View on GitHub
[R] charting package using the JS Flot charting library
☆11Feb 12, 2015Updated 11 years ago
numetriclabz / awesome-db
View on GitHub
A curated list of amazingly awesome database libraries, resources and shiny things by https://www.numetriclabz.com/
☆1,375Mar 4, 2024Updated 2 years ago
filodb / FiloDB
View on GitHub
Distributed Prometheus time series database
☆1,468Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
felixcheung / spark-notebook-examples
View on GitHub
Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin
☆51May 13, 2016Updated 10 years ago
ironfish / presentations
View on GitHub
Presentations about Reactive Architecture Design
☆20Feb 13, 2019Updated 7 years ago
lauris / awesome-scala
View on GitHub
A community driven list of useful Scala libraries, frameworks and software.
☆9,226Sep 20, 2024Updated last year
Leemoonsoo / zeppelin-examples
View on GitHub
Zeppelin notebook examples
☆25Feb 18, 2016Updated 10 years ago
academic / awesome-datascience
View on GitHub
An awesome Data Science repository to learn and apply for real world problems.
☆29,701Updated this week
keeganhines / vivagRaph
View on GitHub
☆12Aug 29, 2015Updated 10 years ago
pedronveloso / awesome-android-release-notes
View on GitHub
Awesome Android Release Notes is a useful directory for a developer to keep up-to-date with all the things related with Android software …
☆21Jan 11, 2017Updated 9 years ago
Tapad / scaerospike
View on GitHub
Scala non-blocking Aerospike client (archived as unmaintained)
☆20Jan 25, 2019Updated 7 years ago
iNiKe / awesome-blockchain
View on GitHub
Awesome of Blockchain, ICO, ₿itcoin, Cryptocurrencies
☆24Jan 3, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jadianes / spark-py-notebooks
View on GitHub
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
☆1,659Mar 16, 2024Updated 2 years ago
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,214Apr 29, 2025Updated last year
pachyderm / pachyderm
View on GitHub
Data-Centric Pipelines and Data Versioning
☆6,299Feb 3, 2025Updated last year
apache / zeppelin
View on GitHub
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
☆6,646Updated this week
hortonworks-gallery / zeppelin-notebooks
View on GitHub
Gallery of Apache Zeppelin notebooks
☆216Jun 19, 2019Updated 7 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,836Mar 3, 2026Updated 4 months ago
daviddiazvico / scikit-datasets
View on GitHub
Scikit-learn-compatible datasets
☆16Jul 11, 2026Updated 2 weeks ago