hopshadoop/hops

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hopshadoop/hops)

hopshadoop / hops

Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.

☆323

Alternatives and similar repositories for hops

Users that are interested in hops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dc-sics / hopsworks
View on GitHub
HopsWorks - Hadoop for Humans
☆117Apr 25, 2019Updated 7 years ago
logicalclocks / feature-store-api
View on GitHub
Python - Java/Scala API for the Hopsworks feature store
☆55Sep 24, 2025Updated 10 months ago
logicalclocks / hopsworks
View on GitHub
Hopsworks - Data-Intensive AI platform with a Feature Store
☆1,302Feb 10, 2025Updated last year
logicalclocks / hopsworks-chef
View on GitHub
Chef Cookbook for Hopsworks
☆11May 4, 2025Updated last year
linkedin / dynamometer
View on GitHub
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
☆135Jan 11, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
karamelchef / karamel
View on GitHub
Reproducing Distributed Systems and Experiments on Cloud
☆41Sep 11, 2023Updated 2 years ago
logicalclocks / hopsworks-api
View on GitHub
Python SDK to interact with the Hopsworks API
☆15Updated this week
logicalclocks / rondb
View on GitHub
This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.
☆716Updated this week
logicalclocks / maggy
View on GitHub
Distribution transparent Machine Learning experiments on Apache Spark
☆91Feb 21, 2024Updated 2 years ago
logicalclocks / hops-examples
View on GitHub
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
☆116Jan 28, 2026Updated 6 months ago
logicalclocks / hops-tensorflow
View on GitHub
HopsYARN Tensorflow Framework.
☆32Oct 22, 2019Updated 6 years ago
logicalclocks / hops-util-py
View on GitHub
Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai
☆27Feb 3, 2026Updated 5 months ago
ExpediaGroup / waggle-dance
View on GitHub
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
☆288Jun 25, 2026Updated last month
MileanCo / angular-material-meteor-dashboard
View on GitHub
Angular Material Meteor Dashboard template
☆14Oct 14, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Intel-bigdata / SSM
View on GitHub
Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution
☆139Jan 3, 2023Updated 3 years ago
apache / ozone
View on GitHub
Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.
☆1,243Updated this week
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Updated this week
Alluxio / alluxio
View on GitHub
Alluxio, data orchestration for analytics and machine learning in the cloud
☆7,215Apr 29, 2025Updated last year
quantcast / qfs
View on GitHub
Quantcast File System
☆648Jul 1, 2026Updated 3 weeks ago
apache / incubator-crail
View on GitHub
Mirror of Apache crail (Incubating)
☆152Jul 3, 2022Updated 4 years ago
miaogecm / FlatFS
View on GitHub
☆23Feb 16, 2023Updated 3 years ago
pravega / pravega
View on GitHub
Pravega - Streaming as a new software defined storage primitive
☆1,998Mar 2, 2025Updated last year
lightbend / mesos-spark-integration-tests
View on GitHub
Mesos Integration Tests on Docker/Ec2
☆15May 25, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
qubole / rubix
View on GitHub
Cache File System optimized for columnar formats and object stores
☆188Aug 11, 2022Updated 3 years ago
SymbioticLab / Fluid
View on GitHub
A Generic Resource-Aware Hyperparameter Tuning Execution Engine
☆15Jan 8, 2022Updated 4 years ago
apache / ratis
View on GitHub
Open source Java implementation for Raft consensus protocol.
☆1,467Updated this week
flexgp / efs
View on GitHub
Evolutionary feature synthesis
☆18Oct 12, 2015Updated 10 years ago
bytedance / nnproxy
View on GitHub
Scalable NameNode RPC Proxy for HDFS Federation
☆89Apr 19, 2016Updated 10 years ago
ExpediaGroup / circus-train
View on GitHub
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
☆93Mar 5, 2024Updated 2 years ago
cerndb / hdfs-metadata
View on GitHub
Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…
☆55May 9, 2017Updated 9 years ago
lightcopy / parquet-index
View on GitHub
Spark SQL index for Parquet tables
☆134May 6, 2021Updated 5 years ago
tony-framework / TonY
View on GitHub
TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
☆707Oct 14, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Skopos-team / Skopos
View on GitHub
Easy to use Deep Reinforcement Learning Library
☆17Jan 12, 2018Updated 8 years ago
ExpediaGroup / datasqueeze
View on GitHub
Hadoop utility to compact small files
☆18Feb 16, 2026Updated 5 months ago
SWIMProjectUCB / SWIM
View on GitHub
Statistical Workload Injector for MapReduce - Project at UC Berkeley AMP Lab
☆128May 29, 2014Updated 12 years ago
ds2-lab / LambdaFS
View on GitHub
λFS: an elastic, high-performance, serverless-function-based metadata service for large-scale distributed file systems (ACM ASPLOS'23)
☆14Apr 2, 2025Updated last year
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,354Updated this week
jcrist / skein
View on GitHub
A tool and library for easily deploying applications on Apache YARN
☆145Mar 12, 2024Updated 2 years ago
apache / yunikorn-core
View on GitHub
Apache YuniKorn Core
☆1,023Updated this week