Reference implementations of data-intensive algorithms in MapReduce and Spark
☆82Sep 3, 2018Updated 7 years ago
Alternatives and similar repositories for bespin
Users that are interested in bespin are comparing it to the libraries listed below
Sorting:
- Lintools: tools by @lintool☆21Jan 26, 2025Updated last year
- Common web archive utility code.☆63Mar 2, 2026Updated 3 weeks ago
- A Hadoop toolkit for web-scale information retrieval research☆85Dec 12, 2014Updated 11 years ago
- A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.☆76Mar 31, 2014Updated 11 years ago
- Track app memory usage.☆11Jan 13, 2015Updated 11 years ago
- ↕️ Intuitive axiomatic retrieval experimentation.☆31Updated this week
- Cloud9 is a Hadoop toolkit for working with big data☆236Dec 15, 2015Updated 10 years ago
- Internet Archive's Sparkling Data Processing Library☆16Mar 3, 2026Updated 2 weeks ago
- A toolkit for simulating interactive information retrieval☆21Sep 7, 2018Updated 7 years ago
- C++ source code for the Dynamic Index algorithm proposed in "Efficient Similarity Computation for Collaborative Filtering in Dynamic Envi…☆16Jul 15, 2019Updated 6 years ago
- Hadoop streaming implementation of Li, et al: "PFP: Parallel FP-Growth for Query Recommendation", applied to the lastfm360k dataset☆11Sep 25, 2013Updated 12 years ago
- Predicting Political Instability and Social Conflicts Using Multimodal Data☆10Jun 6, 2016Updated 9 years ago
- Meta-Analysis of Robust04 Papers (Yang et al., SIGIR 2019)☆12May 25, 2019Updated 6 years ago
- Utility for cui2vec in Go☆13Feb 25, 2023Updated 3 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Feb 1, 2018Updated 8 years ago
- Exploiting SNP correlations within Random Forest for Genome-Wide Association Studies☆13Oct 20, 2014Updated 11 years ago
- Java 8 Factorization Machines Library☆28Feb 17, 2017Updated 9 years ago
- data amusement on the microsoft academic graph☆20Feb 7, 2017Updated 9 years ago
- A protovis visualization of the linked open data cloud.