twitter / GraphJet
GraphJet is a real-time graph processing library.
☆715Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GraphJet
- Cassovary is a simple big graph processing library for the JVM☆1,045Updated 3 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Updated 10 years ago
- Streaming MapReduce with Scalding and Storm☆2,137Updated 2 years ago
- A Scala API for Cascading☆3,500Updated last year
- Simplifying robust end-to-end machine learning on Apache Spark.☆469Updated 7 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,179Updated 4 years ago
- Mirror of Apache Giraph☆618Updated last year
- Compact in-memory representation of directed graph data☆563Updated last year
- Pig Visualization framework☆464Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆898Updated this week
- Library and tools for advanced feature engineering☆568Updated 3 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆854Updated 3 years ago
- A library to implement asynchronous dependency graphs for services in Java☆250Updated last year
- MacroBase: A Search Engine for Fast Data☆661Updated last year
- Abstract Algebra for Scala☆2,289Updated 3 months ago
- An embeddable write-once key-value store written in Java☆939Updated 4 years ago
- A CPU and GPU-accelerated matrix library for data mining☆265Updated 3 years ago
- Mirror of Apache Apex core☆349Updated 3 years ago
- Java streaming parser/serializer for Ion.☆866Updated this week
- PowerGraph: A framework for large-scale machine learning and graph computation.☆346Updated 2 years ago
- Mirror of Apache Samza☆819Updated last month
- A scalable machine learning library on Apache Spark☆792Updated 3 years ago
- All development now happens over here: https://github.com/cwensel/cascading. Cascading is a feature rich API for defining and executing c…☆331Updated 5 years ago
- A java library for stored queries☆374Updated last year
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,787Updated 3 years ago
- Hollow is a java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance r…☆1,205Updated this week
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,040Updated 2 years ago
- Distributed Prometheus time series database☆1,428Updated this week