Optimized joins using bloom filters on Hadoop via Cascading.
☆23Sep 25, 2009Updated 16 years ago
Alternatives and similar repositories for cascading-batch-query
Users that are interested in cascading-batch-query are comparing it to the libraries listed below
Sorting:
- Open source framework for predictive modeling on Apache Hadoop☆34Aug 23, 2014Updated 11 years ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- A modern web-based admin interface for managing SpacetimeDB applications. It provides real-time database management with an intuitive UI …☆22Jun 23, 2025Updated 8 months ago
- A utility to manage HTTP requests from APL☆12Feb 11, 2026Updated 2 weeks ago
- A filter cascade implementation in rust☆15Apr 5, 2023Updated 2 years ago
- Talks at the <Programming> 2022 Conference in Porto, Portugal☆11Mar 30, 2022Updated 3 years ago
- Reagent interface to the Mafs interactive 2d math visualization library.☆15Jun 1, 2024Updated last year
- Webpack loader that does nothing☆10Jul 30, 2015Updated 10 years ago
- A static single page app to allow easy use of books from librivox.org☆12Updated this week
- ☆11Apr 19, 2018Updated 7 years ago
- Stacky ☆ BEAM stack trace in Gleam: a stack trace of stack frames with module name, function name, arity, file name and line numb☆12Jun 6, 2024Updated last year
- A branch of the boilerpipe project☆15Mar 18, 2011Updated 14 years ago
- A fuse filesystem to access the contents of iOS devices☆11Apr 6, 2019Updated 6 years ago
- An ATProto feed to fetch norwegian posts from the Bluesky firehose☆11Dec 27, 2025Updated 2 months ago
- Collection of stylus mixins that help write code in BEM notation☆10May 23, 2019Updated 6 years ago
- This repository contains the reference architecture implementation for the AWS Serverless Developer Experience workshop in Java☆16Feb 19, 2026Updated last week
- Clusterless is a tool for scheduling decentralized, scalable, and secure data pipelines for continuously arriving data, across clouds.☆15Dec 22, 2025Updated 2 months ago
- ☆11Aug 15, 2016Updated 9 years ago
- A crowd-powered database system, with SQL-like query interface, multi-goal optimization☆11Sep 4, 2017Updated 8 years ago
- S2RDF (SPARQL on Spark for RDF) is a SPARQL query processor for Hadoop based on Spark SQL. It uses the relational interface of Spark for …☆13Apr 21, 2018Updated 7 years ago
- Rekube is a ReasonML toolkit for Kubernetes configuration.☆28Apr 27, 2020Updated 5 years ago
- Composable metric reporters in Python.☆14Jun 6, 2024Updated last year
- Stock Backtester and Analysis application for EOD (End-of-day) data.☆15Mar 11, 2018Updated 7 years ago
- The Polyglot LR parser generator☆13Dec 20, 2021Updated 4 years ago
- ☆14Dec 16, 2019Updated 6 years ago
- Export Haskell type and aeson serializations to OCaml BuckleScript☆17Dec 11, 2020Updated 5 years ago
- Example Demo of Python <-> Java IPC using Google Protocol Buffers☆12May 11, 2016Updated 9 years ago
- PEG library for the Go language☆68Nov 14, 2016Updated 9 years ago
- a sample repository for terraform to run Amazon SageMaker notebook part 1☆13Jan 16, 2020Updated 6 years ago
- Demonstrate the some of features of gRPC☆14Dec 15, 2019Updated 6 years ago
- Metabase driver and plugin for Materialize☆14Jul 28, 2025Updated 7 months ago
- Haskell library for non-deterministic pattern matching☆17Dec 21, 2025Updated 2 months ago
- Keep your MobX state in sync with react-router☆13Apr 27, 2022Updated 3 years ago
- ☆15Dec 19, 2025Updated 2 months ago
- Translations for ImportNew☆18Feb 21, 2014Updated 12 years ago
- ☆12Jul 4, 2018Updated 7 years ago
- Reference architecture for building Event Driven Architectures (EDA) on Amazon EKS☆13Mar 26, 2024Updated last year
- A computational model for insanely complex functions☆40May 13, 2012Updated 13 years ago
- JSON Web Token implementation for Java according to RFC 7519. Easily create, parse and validate JSON Web Tokens using a fluent API.☆14Jul 17, 2025Updated 7 months ago