drelu/webhdfs-py

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/drelu/webhdfs-py)

drelu / webhdfs-py

Python Client for WebHDFS REST API

☆43

Alternatives and similar repositories for webhdfs-py

Users that are interested in webhdfs-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sematext / jmxc
View on GitHub
Simple JMX Console
☆17Dec 8, 2012Updated 13 years ago
wilbur / Piggybank
View on GitHub
A reporistory of User-defined functions for Apache Pig
☆16Sep 20, 2010Updated 15 years ago
ProjectMeniscus / pywebhdfs
View on GitHub
Python wrapper for the hadoop WebHDFS Rest API
☆32Apr 11, 2015Updated 11 years ago
alienrobotwizard / varaha
View on GitHub
Machine learning and natural language processing with Apache Pig
☆53Dec 17, 2013Updated 12 years ago
trovit / hdfstree
View on GitHub
A command line tool to display HDFS directories as a tree.
☆16Sep 3, 2013Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sriksun / Ivory
View on GitHub
Data Management + Feed Processing Platform over Hadoop
☆27May 8, 2013Updated 13 years ago
corbt / pypeline
View on GitHub
A tool for managing data processing in Python
☆23May 9, 2014Updated 12 years ago
tims / lasthbase
View on GitHub
things last.fm uses with hbase
☆28Oct 28, 2011Updated 14 years ago
ThinkBigAnalytics / scalding-workshop
View on GitHub
A half-day workshop on Scalding, the Scala API for Cascading
☆48Mar 21, 2016Updated 10 years ago
DonDebonair / flume-plugins
View on GitHub
Some extensions to Flume to help with collecting logs and storing as Avro.
☆17Feb 22, 2014Updated 12 years ago
divolte / divolte-kafka-consumer
View on GitHub
Helper for consuming Divolte events from Kafka queues and deserializing Avro records into Java objects using Avro's generated code.
☆15Nov 6, 2014Updated 11 years ago
hougs / scala-dataflow-dsl
View on GitHub
A scala dsl for dataflow
☆11Dec 31, 2014Updated 11 years ago
CyberAgent / patriot-workflow-scheduler
View on GitHub
☆24Jun 14, 2019Updated 7 years ago
jkleint / ansible-hadoop
View on GitHub
THIS REPOSITORY IS VERY OUTDATED. See Ansible Galaxy instead.
☆28Oct 23, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
livingsocial / HiveSwarm
View on GitHub
Helpful user defined fuctions / table generating functions for Hive
☆102May 2, 2016Updated 10 years ago
klbostee / feathers
View on GitHub
Java classes that can be useful for Dumbo programs that run on Hadoop Streaming.
☆26May 20, 2012Updated 14 years ago
tzolov / zeppelin-ambari-plugin
View on GitHub
Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.
☆14Jan 23, 2016Updated 10 years ago
LinkedInAttic / datafu
View on GitHub
Hadoop library for large-scale data processing, now an Apache Incubator project
☆581Jul 8, 2014Updated 12 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
tmalaska / Spark.TableStatsExample
View on GitHub
Simple Spark example of generating table stats for use of data quality checks
☆27Apr 28, 2017Updated 9 years ago
kevinweil / pig.tmbundle
View on GitHub
Simple syntax highlighting for writing Pig scripts (http://hadoop.apache.org/pig) in Textmate.
☆35May 2, 2013Updated 13 years ago
mhausenblas / operator-101
View on GitHub
A step-by-step walkthrough of bootstrapping a Kubernetes operator
☆21Nov 16, 2018Updated 7 years ago
laserson / impyla-old
View on GitHub
OLD - impyla now developed at `cloudera/impyla`
☆23Apr 16, 2014Updated 12 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rjurney / enron-python-flask-cassandra-pig
View on GitHub
Hortonworks demo of Enron emails with Pig, Cassandra, Python and Flask
☆17Oct 1, 2012Updated 13 years ago
tomslabs / avro-utils
View on GitHub
Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming
☆26Sep 10, 2013Updated 12 years ago
bwhite / hadoopy
View on GitHub
Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.
☆243Jan 8, 2016Updated 10 years ago
dgleich / matrix-hadoop-tutorial
View on GitHub
A set of tutorial codes about matrix methods in Hadoop
☆32Apr 10, 2013Updated 13 years ago
mmay / PigJsonLoader
View on GitHub
A Load UDF for loading JSON files with Pig
☆15Jul 6, 2011Updated 15 years ago
chimpler / hive-solr
View on GitHub
Hive Storage Handler for SOLR
☆16Mar 17, 2014Updated 12 years ago
klbostee / ctypedbytes
View on GitHub
A fast Python module for dealing with so called "typed bytes".
☆15Mar 24, 2015Updated 11 years ago
pauldeschacht / impala-java-client
View on GitHub
Java client to connect directly to Impala using thrift
☆31Apr 12, 2017Updated 9 years ago
Cascading / cascading-hive
View on GitHub
Integration for Cascading and Apache Hive
☆25Oct 31, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eljefe6a / UnoExample
View on GitHub
MapReduce/Hadoop example that uses regular playing cards to show mapping and reducing.
☆40Jun 13, 2014Updated 12 years ago
jatrost / hadoop-binary-analysis
View on GitHub
Framework that makes processing arbitrary binary data in Hadoop easier
☆22Apr 8, 2013Updated 13 years ago
prb / bigbird
View on GitHub
An attempt to create a Twitter-like beastie backed by big data storage.
☆29Oct 4, 2022Updated 3 years ago
snowplow-archive / kinesis-example-scala-consumer
View on GitHub
Example Scala/SBT event consumer for Amazon Kinesis
☆22May 20, 2015Updated 11 years ago
julienledem / Pig-scripting-examples
View on GitHub
Examples of use of pig scripting languages capabilities
☆39Aug 1, 2016Updated 9 years ago
msukmanowsky / omniture-data-tools
View on GitHub
A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.
☆37May 14, 2019Updated 7 years ago
jaredwinick / Trendulo
View on GitHub
Trending on Accumulo
☆40Oct 3, 2012Updated 13 years ago