hopshadoop / hopsView external linksLinks
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.
☆320Jan 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for hops
Users that are interested in hops are comparing it to the libraries listed below
Sorting:
- HopsWorks - Hadoop for Humans☆117Apr 25, 2019Updated 6 years ago
- Python - Java/Scala API for the Hopsworks feature store☆55Sep 24, 2025Updated 4 months ago
- A tool for scale and performance testing of HDFS with a specific focus on the NameNode.☆134Jan 11, 2024Updated 2 years ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,281Feb 10, 2025Updated last year
- Chef Cookbook for Hopsworks☆11May 4, 2025Updated 9 months ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Sep 11, 2023Updated 2 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Feb 21, 2024Updated last year
- Python SDK to interact with the Hopsworks API☆14Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆303Oct 30, 2025Updated 3 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Nov 26, 2025Updated 2 months ago
- Mirror of Apache crail (Incubating)☆151Jul 3, 2022Updated 3 years ago
- This is RonDB, a distribution of NDB Cluster developed and used by Hopsworks AB. It also contains development branches of RonDB.☆701Updated this week
- Quantcast File System☆648Jan 1, 2026Updated last month
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆140Jan 3, 2023Updated 3 years ago
- Mesos Integration Tests on Docker/Ec2☆15May 25, 2023Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Mar 5, 2024Updated last year
- An HDFS backed ContentsManager implementation for Jupyter☆12Apr 8, 2024Updated last year
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,157Apr 29, 2025Updated 9 months ago
- Pravega - Streaming as a new software defined storage primitive☆2,006Mar 2, 2025Updated 11 months ago
- Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.☆1,157Updated this week
- Storage Engine for block and key/value stores.☆25Feb 7, 2026Updated last week
- A tool and library for easily deploying applications on Apache YARN☆146Mar 12, 2024Updated last year
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Jan 28, 2026Updated 2 weeks ago
- Flink Examples☆38Apr 27, 2016Updated 9 years ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- High performance data store solution☆1,446Jan 21, 2026Updated 3 weeks ago
- Apache YuniKorn Core☆1,002Feb 4, 2026Updated last week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,575Feb 10, 2026Updated last week
- A simplified, lightweight ETL Framework based on Apache Spark☆589Jan 24, 2024Updated 2 years ago
- Jupyter Hub Support in VS Code☆15Feb 7, 2026Updated last week
- HBase Indexer - indexing HBase to Solr 5.x and higher☆13Oct 27, 2017Updated 8 years ago
- Spark SQL index for Parquet tables☆134May 6, 2021Updated 4 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55May 9, 2017Updated 8 years ago
- TonY is a framework to natively run deep learning frameworks on Apache Hadoop.☆713Oct 14, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆886Feb 9, 2026Updated last week
- Cray Lustre is HPE's curated Lustre distro for HPE ClusterStor, Cray EX, and other HPE/Cray clients☆17Updated this week
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Apr 27, 2019Updated 6 years ago
- GeoTrellis PointCloud library to work with any pointcloud data on Spark☆27Oct 5, 2020Updated 5 years ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Feb 3, 2026Updated last week