FINRAOS/herd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FINRAOS/herd)

FINRAOS / herd

Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabytes of data and make it accessible for data processing and analytical purposes by any cloud compute platform.

☆140

Alternatives and similar repositories for herd

Users that are interested in herd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FINRAOS / herd-mdl
View on GitHub
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
☆16Jul 17, 2024Updated 2 years ago
dstreev / hdp-data-gen
View on GitHub
Hortonworks Data Platform Data Generation Tool
☆13Nov 30, 2017Updated 8 years ago
tzolov / zeppelin-ambari-plugin
View on GitHub
Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.
☆14Jan 23, 2016Updated 10 years ago
Aloisius / hadoop-s3a
View on GitHub
An AWS SDK-backed FileSystem driver for Hadoop
☆63Oct 13, 2020Updated 5 years ago
miguel10 / YARN-Memory-Calculator
View on GitHub
Hadoop YARN & MapReduce Memory Calculator
☆13Nov 9, 2015Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
awslabs / aws-lambda-kinesis-prewarming
View on GitHub
Provides canary based prewarming of lambda functions for Kinesis Event Sources.
☆15Oct 13, 2020Updated 5 years ago
Cascading / tutorials
View on GitHub
Tutorials for Cascading, Lingual, Pattern and other projects
☆18Aug 30, 2016Updated 9 years ago
youngwookim / awesome-presto
View on GitHub
A curated list of awesome PrestoDB / Trino software, libraries, tools and resources
☆18Jun 28, 2021Updated 5 years ago
qubole / streamx
View on GitHub
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
☆96Apr 4, 2019Updated 7 years ago
jxblum / spring-boot-gemfire-server-example
View on GitHub
An example Spring Boot application demonstrating how to configure and bootstrap a Pivotal GemFire Server in a Spring context, JVM-based p…
☆12May 11, 2018Updated 8 years ago
chainpoint / chainpoint-binary
View on GitHub
A Javascript library for converting between Chainpoint JSON-LD and binary proof formats
☆11Apr 20, 2022Updated 4 years ago
vigsterkr / marathonspawner
View on GitHub
Spawns JupyterHub single user servers in Marathon
☆10Oct 8, 2017Updated 8 years ago
AbsaOSS / spark-hofs
View on GitHub
Scala API for Apache Spark SQL high-order functions
☆15Aug 4, 2023Updated 2 years ago
randerzander / r-service
View on GitHub
Ambari Service definition for deploying R & RHadoop libraries
☆18Aug 3, 2015Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
randerzander / docker-ambari
View on GitHub
Dockerfile and artifacts for running a self-contained HDP 2.3 "cluster" in a docker container
☆10Aug 30, 2016Updated 9 years ago
seanorama / ansible-ambari
View on GitHub
Quickly deploy Hadoop with the help of Ansible and Apache Ambari
☆38Jul 15, 2015Updated 11 years ago
rbalamohan / tez-autobuild
View on GitHub
A Tez dev-setup for HDP2 sandbox
☆21Mar 2, 2023Updated 3 years ago
viirya / grafana-presto
View on GitHub
grafana with presto support
☆25Aug 11, 2019Updated 6 years ago
ianoc / SparkEMRBootstrap
View on GitHub
Files to help make new spark EMR Bootstraps
☆15Aug 4, 2013Updated 12 years ago
codecentric / elasticsearch-shield-kerberos-realm
View on GitHub
Kerberos/SPNEGO custom realm for Elasticsearch Shield 2.0
☆16Jan 19, 2018Updated 8 years ago
Netflix / inviso
View on GitHub
☆205May 23, 2023Updated 3 years ago
tgrall / drill-workshop
View on GitHub
Apache Drill Workshop
☆19Apr 4, 2016Updated 10 years ago
full360 / glue-sneaql-demo
View on GitHub
☆12Mar 31, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VividCortex / lastseen
View on GitHub
Last-seen sketch implementation in Go
☆16Dec 15, 2020Updated 5 years ago
FINRAOS / MLiy
View on GitHub
MLiy (pronounced “Emily”) is a machine-learning platform that allows data scientists to provision and manage processing power in the clou…
☆11May 22, 2023Updated 3 years ago
ekesken / docker-rabbitmq
View on GitHub
docker image to deploy rabbitmq cluster on mesos with one marathon app
☆10Oct 12, 2017Updated 8 years ago
ZEPL / z-manager
View on GitHub
Simplify getting Zeppelin up and running
☆56Jul 20, 2016Updated 10 years ago
vivint-smarthome / ceph-on-mesos
View on GitHub
Ceph on Mesos
☆20Apr 8, 2017Updated 9 years ago
jeoffreylim / maelstrom
View on GitHub
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …
☆21Feb 6, 2017Updated 9 years ago
dvergari / ambari-drill-service
View on GitHub
Ambari service for Apache Drill
☆17Apr 15, 2016Updated 10 years ago
bluecolor / octopus
View on GitHub
Open source task scheduler with dependency management
☆15Jul 1, 2018Updated 8 years ago
brightcove-archive / ooyala_scamr
View on GitHub
A Hadoop map reduce framework for Scala.
☆15Apr 21, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cloudera-labs / envelope
View on GitHub
Build configuration-driven ETL pipelines on Apache Spark
☆162Oct 4, 2022Updated 3 years ago
spirom / spark-data-sources
View on GitHub
Developing Spark External Data Sources using the V2 API
☆49Apr 29, 2018Updated 8 years ago
hortonworks / cloudbreak-deployer
View on GitHub
Cloudbreak Deployer Tool
☆34Jun 29, 2023Updated 3 years ago
portable-scala / sbt-crossproject.g8
View on GitHub
Giter8 template for a simple project that uses sbt-crossproject.
☆11Jul 16, 2018Updated 8 years ago
abajwa-hw / solr-stack
View on GitHub
Ambari stack service for easily installing and managing Solr on HDP cluster
☆37Jan 3, 2018Updated 8 years ago
milinda / samza-sql
View on GitHub
SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka
☆30Jun 8, 2016Updated 10 years ago
dharmeshkakadia / Data-Infra-Projects
View on GitHub
List of some interesting projects
☆32Dec 24, 2019Updated 6 years ago