medined / D4M_Schema
This project describes the D4M 2.0 Schema used in many Accumulo systems.
☆21Updated 4 years ago
Alternatives and similar repositories for D4M_Schema:
Users that are interested in D4M_Schema are comparing it to the libraries listed below
- Recipes & cookbooks for Accumulo.☆37Updated 8 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- Dynamic Distributed Dimensional Data Model☆41Updated 10 months ago
- Presto Accumulo Integration☆24Updated last year
- InsightEdge Core☆20Updated 11 months ago
- scalding powered machine learning☆109Updated 10 years ago
- A framework for scalable graph computing.☆147Updated 6 years ago
- Bucketing and partitioning system for Parquet☆30Updated 6 years ago
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- Introducing D4M with Baseball analytics☆17Updated 10 years ago
- A compiler for Pig Latin to Spark and Flink.☆23Updated 5 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆13Updated 7 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Vizlinc☆14Updated 9 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Updated 5 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 9 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated 10 months ago
- Cascading on Apache Flink®☆54Updated last year
- A collection of Scala graph libraries and adapters for graph databases.☆14Updated 8 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆19Updated 10 years ago
- analytics tool kit☆43Updated 8 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆37Updated 11 months ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Grab 'em, Tag 'em, Graph 'em (GTG) algorithm☆6Updated 8 years ago
- An implementation of TinkerPop Blueprints using Accumulo☆32Updated 9 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year