nathanmarz/dfs-datastores

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nathanmarz/dfs-datastores)

nathanmarz / dfs-datastores

Dead-simple vertical partitioning, compression, appends, and consolidation of data on a distributed filesystem.

☆215

Alternatives and similar repositories for dfs-datastores

Users that are interested in dfs-datastores are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jameswarren / big-data-src
View on GitHub
example source code for Big Data book (www.manning.com/marz)
☆20Nov 23, 2013Updated 12 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
Cascading / cascading-thrift
View on GitHub
Serializer and comparator for using Thrift objects in Cascading or Cascalog
☆17Dec 31, 2014Updated 11 years ago
nathanmarz / elephantdb-cascalog
View on GitHub
Seamless integration of ElephantDB with Cascalog
☆18Jan 3, 2012Updated 14 years ago
sritchie / midje-cascalog
View on GitHub
Cascalog functions for Midje.
☆20Feb 16, 2013Updated 13 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nathanmarz / cascalog
View on GitHub
Data processing on Hadoop without the hassle.
☆1,373May 18, 2023Updated 3 years ago
revelytix / carbonite
View on GitHub
Clojure library for serializing Clojure data using Kryo
☆59Jan 21, 2012Updated 14 years ago
sorenmacbeth / hbase-cascalog
View on GitHub
A very simple wrapper around cascading.hbase for use in Cascalog
☆19Jan 16, 2017Updated 9 years ago
nathanmarz / cascalog-workshop
View on GitHub
Materials for Cascalog workshop
☆18Sep 17, 2011Updated 14 years ago
nathanmarz / cascalog-contrib
View on GitHub
☆45Feb 16, 2013Updated 13 years ago
miguno / replephant
View on GitHub
A Clojure library to interactively analyze Hadoop cluster usage via REPL
☆34Dec 20, 2013Updated 12 years ago
Cascading / maple
View on GitHub
All the Cascading taps you need and love.
☆39Mar 4, 2019Updated 7 years ago
pallet / pallet-hadoop
View on GitHub
Hadoop Cluster Management with Intelligent Defaults
☆41Apr 18, 2014Updated 12 years ago
LiveRamp / cascading_ext
View on GitHub
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high…
☆58Feb 25, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sritchie / jackknife
View on GitHub
Useful Clojure utilities!
☆16Oct 4, 2015Updated 10 years ago
jeroenvandijk / cascalog-graph
View on GitHub
Graph implementation for Cascalog
☆26May 11, 2014Updated 12 years ago
schleyfox / storm-test
View on GitHub
Testing utilities for storm
☆40Jan 5, 2012Updated 14 years ago
racehub / forms-bootstrap
View on GitHub
Utility for creating web forms in Clojure using Twitter's Bootstrap CSS.
☆20Mar 23, 2015Updated 11 years ago
nathanmarz / cascalog-demo
View on GitHub
A short Cascalog program that produces a simplified version of a Facebook-like news feed.
☆26Aug 25, 2015Updated 10 years ago
pallet / pallet
View on GitHub
Automates controlling and provisioning cloud server instances. DevOps for the JVM.
☆807May 25, 2018Updated 8 years ago
damballa / parkour
View on GitHub
Hadoop MapReduce in idiomatic Clojure.
☆255Mar 23, 2016Updated 10 years ago
coventry / troncle
View on GitHub
Speed up repl debugging with tracing convenience functions
☆48Jan 12, 2014Updated 12 years ago
sritchie / carbonite
View on GitHub
Clojure library for serializing Clojure data using Kryo
☆30Feb 12, 2016Updated 10 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sorenmacbeth / marceline
View on GitHub
A Clojure DSL for Storm/Trident
☆177Apr 3, 2017Updated 9 years ago
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
datasalt / splout-db
View on GitHub
A web-latency SQL spout for Hadoop.
☆51Jan 25, 2021Updated 5 years ago
mmcgrana / clj-redis
View on GitHub
Clojure Redis client library
☆79Dec 13, 2011Updated 14 years ago
razvan / kafka-s3-consumer
View on GitHub
Store batched Kafka messages in S3.
☆39Apr 13, 2022Updated 4 years ago
abedra / accession
View on GitHub
☆42Dec 5, 2018Updated 7 years ago
davidsantiago / clojure-hbase
View on GitHub
A simple library for accessing HBase conveniently from Clojure.
☆68Jul 15, 2014Updated 12 years ago
nathanmarz / storm-contrib
View on GitHub
A collection of spouts, bolts, serializers, DSLs, and other goodies to use with Storm
☆579Aug 23, 2022Updated 3 years ago
nathanmarz / storm-mesos
View on GitHub
Run Storm on top of the Mesos cluster resource manager
☆68Jul 14, 2014Updated 12 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ParallelAI / SpyGlass
View on GitHub
Cascading and Scalding wrapper for HBase with advanced read features
☆54Feb 11, 2020Updated 6 years ago
sorenmacbeth / storm-redis-pubsub
View on GitHub
A Redis PubSub Spout for Storm
☆37Feb 29, 2012Updated 14 years ago
rapportive-oss / storm-amqp-spout
View on GitHub
Allows a Storm topology to consume an AMQP exchange as an input source.
☆55Oct 3, 2012Updated 13 years ago
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
Cascading / cascading
View on GitHub
All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…
☆332Nov 29, 2018Updated 7 years ago
liebke / avout
View on GitHub
Avout: Distributed State in Clojure
☆425Aug 29, 2019Updated 6 years ago
alexott / clojure-hadoop
View on GitHub
Library to aid writing Hadoop jobs in Clojure.
☆98Nov 21, 2013Updated 12 years ago