RHadoop
☆762Nov 24, 2015Updated 10 years ago
Alternatives and similar repositories for RHadoop
Users that are interested in RHadoop are comparing it to the libraries listed below
Sorting:
- A package that allows R developers to use Hadoop HDFS☆64Mar 7, 2018Updated 7 years ago
- A package that allows R developer to use Hadoop MapReduce☆158Jul 21, 2020Updated 5 years ago
- ☆38Mar 25, 2015Updated 10 years ago
- A package that allows R developers to use Hadoop HBase☆48Jul 9, 2014Updated 11 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆582Jul 8, 2014Updated 11 years ago
- RHive is an R extension facilitating distributed computing via Apache Hive.☆123Jul 19, 2017Updated 8 years ago
- R frontend for Spark☆642Jun 10, 2016Updated 9 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Apr 9, 2013Updated 12 years ago
- R and Hadoop Integrated Programming Environment☆58Dec 18, 2023Updated 2 years ago
- Examples of use of pig scripting languages capabilities☆39Aug 1, 2016Updated 9 years ago
- Mahout vector encoding for pig☆53Nov 20, 2022Updated 3 years ago
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Jul 4, 2011Updated 14 years ago
- Log processing system using Flume and Cassandra☆75Mar 4, 2011Updated 14 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆294Jun 29, 2022Updated 3 years ago
- A web server interface for the R language☆52Jan 12, 2012Updated 14 years ago
- SQL Windowing Functions for Hadoop☆65Jun 20, 2022Updated 3 years ago
- A nicer deparse☆12Jun 22, 2017Updated 8 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,425Apr 8, 2014Updated 11 years ago
- R driver for MongoDB☆82Jan 7, 2013Updated 13 years ago
- Revolution R Open☆86Jul 22, 2016Updated 9 years ago
- spark backend for dplyr☆48Dec 30, 2015Updated 10 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,792Aug 16, 2017Updated 8 years ago
- NEW: see http://www.hops.io/. OLD: This work aims to re-engineer the Hadoop Distributed File System (HDFS) so that it can be 1) highly av…☆26Jan 2, 2012Updated 14 years ago
- Statistical analysis of tweets from members of the U.S. Congress☆24Sep 13, 2011Updated 14 years ago
- ZIA Code Repository☆97Aug 20, 2013Updated 12 years ago
- Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.☆353Apr 8, 2025Updated 10 months ago
- Hive + Avro. Serde for working with Avro in Hive☆59Dec 16, 2023Updated 2 years ago
- WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for effici…☆944May 26, 2021Updated 4 years ago
- Text clustering service for the web☆25Mar 30, 2019Updated 6 years ago
- Oozie - workflow engine for Hadoop☆374Jun 8, 2017Updated 8 years ago
- Turn KML Files into tidy data frames:☆12Jan 1, 2017Updated 9 years ago
- SnakeCharmR - R and Python Integration☆17Jan 2, 2020Updated 6 years ago
- Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/…☆134Mar 31, 2010Updated 15 years ago
- A tutorial on R and Hadoop, using the RHadoop project☆41Aug 15, 2015Updated 10 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,032Jan 9, 2018Updated 8 years ago
- A web front end for R Twitter sentiment analysis☆18Sep 2, 2011Updated 14 years ago
- Load Avro data into Spark with sparklyr☆12Jun 4, 2020Updated 5 years ago
- Reusable code for Hive☆16Aug 19, 2014Updated 11 years ago