shanyu/hadooplogparser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shanyu/hadooplogparser)

shanyu / hadooplogparser

Hadoop Yarn aggregated log parser utility

☆23

Alternatives and similar repositories for hadooplogparser

Users that are interested in hadooplogparser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eljefe6a / HBaseREST
View on GitHub
Sample Python code for working with the HBase REST interface
☆24Jul 25, 2013Updated 12 years ago
jdye64 / docker-hwx
View on GitHub
Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components
☆10Oct 11, 2019Updated 6 years ago
broxtronix / spark-gce
View on GitHub
A tool for running Spark on Google Compute Engine
☆16Jan 20, 2017Updated 9 years ago
linyiqun / open-source-patch
View on GitHub
项目中保留了向开源社区提交过的patch
☆16Oct 22, 2017Updated 8 years ago
CoxAutomotiveDataSolutions / spark-distcp
View on GitHub
A re-implementation of Hadoop DistCP in Apache Spark
☆47Dec 20, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
linkedin / Avro2TF
View on GitHub
Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.
☆129May 9, 2020Updated 6 years ago
jparkie / Spark2Elasticsearch
View on GitHub
Spark Library for Bulk Loading into Elasticsearch
☆12Apr 25, 2016Updated 10 years ago
rayokota / stream-processing-kickstarter
View on GitHub
A comparison of stream-processing frameworks with Kafka integration
☆10Nov 30, 2018Updated 7 years ago
druid-io / druid-benchmark
View on GitHub
Druid Benchmark
☆20Jun 30, 2017Updated 9 years ago
paypal / NNAnalytics
View on GitHub
NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.
☆121Nov 25, 2025Updated 7 months ago
lukasz-antoniak / kafka
View on GitHub
Mirror of Apache Kafka without ZooKeeper dependency
☆12Feb 4, 2019Updated 7 years ago
CASP-Systems-BU / Gadget
View on GitHub
A Benchmark Harness for Systematic and Robust Evaluation of Streaming State Stores
☆17Apr 24, 2024Updated 2 years ago
getindata / kedro-airflow-k8s
View on GitHub
Kedro Plugin to support running pipelines on Kubernetes using Airflow.
☆27Mar 11, 2025Updated last year
hpgrahsl / wearedevs-2018
View on GitHub
Code for my talk "Stateful & Reactive Streaming Applications Without a Database" at WeAreDevelopers 2018
☆11May 20, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pulumi / pulumi-kafka
View on GitHub
A Kafka Pulumi resource package, providing multi-language access to Kafka
☆18Updated this week
Azure / azure-data-lake-store-java
View on GitHub
Microsoft Azure Data Lake Store Filesystem Library for Java
☆21Mar 1, 2026Updated 4 months ago
streamthoughts / kafka-connect-transform-grok
View on GitHub
Grok Expression Transform for Kafka Connect.
☆16Jun 26, 2026Updated 3 weeks ago
farmdawgnation / kafka-hawk
View on GitHub
An application that records stats about consumer group offset commits and reports them as prometheus metrics
☆14Apr 27, 2019Updated 7 years ago
iheanyi / simple-canary
View on GitHub
Simple Canary Testing Framework
☆18Sep 28, 2018Updated 7 years ago
ysc / baby-typing-game
View on GitHub
适合2到6岁的宝宝打字游戏
☆10May 29, 2020Updated 6 years ago
manifoldco / healthz
View on GitHub
Easily add health checks to your go services
☆23Apr 1, 2026Updated 3 months ago
haskell-works / hw-kafka-avro
View on GitHub
SchemaRegistry bindings with Avro scheme to use with kafka-client
☆16Aug 6, 2025Updated 11 months ago
siddhi-io / siddhi-io-kafka
View on GitHub
Extension that can be used to receive events from a Kafka cluster and to publish events to a Kafka cluster
☆18May 13, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jpzk / kafcache
View on GitHub
Kafka Streams + Memcached (e.g. AWS ElasticCache) for low-latency in-memory lookups
☆13Nov 4, 2019Updated 6 years ago
INFINITE-TECHNOLOGY / COBOL
View on GitHub
Groovy Cobol Transpiler, Runtime environment and API
☆18Nov 21, 2024Updated last year
Spratiher9 / JumpSpark
View on GitHub
JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.
☆10May 12, 2023Updated 3 years ago
eto-ai / spark-video
View on GitHub
Processing videos on Apache Spark
☆13Feb 14, 2022Updated 4 years ago
radanalyticsio / streaming-amqp
View on GitHub
AMQP data source for dstream (Spark Streaming)
☆26Mar 31, 2022Updated 4 years ago
shafiquejamal / kafka-zookeeper-kerberos
View on GitHub
Instructions for setting up Kerberos, Zookeeper, and Kafka with SASL
☆16Jan 22, 2018Updated 8 years ago
mapbox / xt
View on GitHub
Automatically convert a stream of tile coordinates to another format
☆13Jun 29, 2026Updated 3 weeks ago
allegro / camus-compressor
View on GitHub
Camus Compressor merges files created by Camus and saves them in a compressed format.
☆13Mar 20, 2023Updated 3 years ago
ZuInnoTe / spark-hadoopcryptoledger-ds
View on GitHub
A Spark datasource for the HadoopCryptoLedger library
☆13Sep 29, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cloudcheflabs / dataroaster
View on GitHub
☆42May 16, 2023Updated 3 years ago
cloud-bulldozer / scale-ci-pipeline
View on GitHub
Automation to install, configure, scale test OpenShift and onboard new workloads
☆17Oct 4, 2024Updated last year
dangoldin / poor-mans-data-pipeline
View on GitHub
A minimal and serverless data pipeline
☆13Feb 22, 2022Updated 4 years ago
adobe / koperator
View on GitHub
Oh no! Yet another Kafka operator for Kubernetes
☆21Updated this week
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
sqlanywhere / sqlalchemy-sqlany
View on GitHub
SQLAlchemy driver for SAP Sybase SQL Anywhere
☆12Mar 9, 2023Updated 3 years ago
icolbert / upsampling
View on GitHub
Algorithmic solutions to optimize inference for convolution-based image upsampling. Coded for clarity, not speed.
☆10Aug 26, 2022Updated 3 years ago