lintool/Ivory

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lintool/Ivory)

lintool / Ivory

A Hadoop toolkit for web-scale information retrieval research

☆87

Alternatives and similar repositories for Ivory

Users that are interested in Ivory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lintool / Cloud9
View on GitHub
Cloud9 is a Hadoop toolkit for working with big data
☆237Dec 15, 2015Updated 10 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
lintool / IR-Reproducibility
View on GitHub
Open-Source Information Retrieval Reproducibility Challenge
☆51Jan 11, 2016Updated 10 years ago
turian / textrank
View on GitHub
Java implementation of the TextRank algorithm by Mihalcea, et al. http://lit.csci.unt.edu/index.php/Graph-based_NLP
☆18May 25, 2010Updated 16 years ago
alad / Mekano
View on GitHub
Building blocks for Information Retrieval & Machine Learning
☆16Oct 12, 2010Updated 15 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alienrobotwizard / varaha
View on GitHub
Machine learning and natural language processing with Apache Pig
☆53Dec 17, 2013Updated 12 years ago
josephreisinger / lvm-toolkit
View on GitHub
UT Austin Machine Learning Group Latent Variable Modeling Toolkit
☆26Feb 2, 2012Updated 14 years ago
metzlerd / mavuno
View on GitHub
Mavuno: A Hadoop-Based Text Mining Toolkit
☆48Feb 7, 2012Updated 14 years ago
lintool / clueweb
View on GitHub
Hadoop tools for manipulating ClueWeb collections
☆26Jul 15, 2016Updated 10 years ago
aritter / LDA-SP
View on GitHub
Includes Code for Inference and Evaluation of Topic Models for Selectional Preferences
☆16Mar 10, 2023Updated 3 years ago
teanalab / SWDM
View on GitHub
SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
☆36Aug 2, 2017Updated 8 years ago
lintool / Mr.LDA
View on GitHub
Scalable Topic Modeling using Variational Inference in MapReduce
☆149Oct 20, 2015Updated 10 years ago
jjfiv / galago-git
View on GitHub
Experimental Git Mirror of "https://sourceforge.net/p/lemur/galago" using "https://github.com/felipec/git-remote-hg"
☆13Dec 17, 2020Updated 5 years ago
davidandrzej / chisel
View on GitHub
Clojure wrapper for LDA topic modeling in MALLET
☆33Sep 6, 2011Updated 14 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
LanceNorskog / LSH-Hadoop
View on GitHub
Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations
☆28Oct 15, 2011Updated 14 years ago
josephreisinger / dist_lda
View on GitHub
distributed latent dirichlet allocation
☆29Dec 15, 2011Updated 14 years ago
agesmundo / HadoopPerceptron
View on GitHub
http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36266.pdf
☆14Apr 25, 2012Updated 14 years ago
bmuller / pymur
View on GitHub
pymur is a Python interface to The Lemur Toolkit.
☆19Sep 17, 2018Updated 7 years ago
lenn0x / Scribe-log4j-Appender
View on GitHub
Generic log4j appender that uses Scribe for sending log messages
☆32Aug 24, 2010Updated 15 years ago
DigitalPebble / behemoth
View on GitHub
Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
☆283Apr 25, 2018Updated 8 years ago
eXascaleInfolab / pytrec_eval
View on GitHub
A library to evaluate TREC-like runs with TREC-like qrels. Implements similarity of rankings, ttest between runs etc…
☆19May 4, 2020Updated 6 years ago
infochimps-labs / wonderdog
View on GitHub
Bulk loading for elastic search
☆186Dec 16, 2023Updated 2 years ago
interllective / MongoReduce
View on GitHub
Hadoop Input and Ouput formats for MongoDB
☆29Nov 15, 2011Updated 14 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ewhauser / flume-kafka-plugin
View on GitHub
☆23Oct 17, 2011Updated 14 years ago
s4 / core
View on GitHub
S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…
☆233Mar 4, 2011Updated 15 years ago
lintool / twitter-tools
View on GitHub
Twitter Tools
☆222Feb 18, 2018Updated 8 years ago
Cue / lucene-interval-fields
View on GitHub
Lucene fields and queries for interval fields.
☆39Dec 23, 2015Updated 10 years ago
tdunning / knn
View on GitHub
Large scale k-nn experiments
☆69Jul 31, 2024Updated last year
FurongHuang / TensorDecomposition4TopicModeling
View on GitHub
☆20Jun 26, 2017Updated 9 years ago
cvangysel / pyndri
View on GitHub
pyndri is a Python interface to the Indri search engine.
☆89Jun 21, 2022Updated 4 years ago
faneshion / DRMM
View on GitHub
CIKM 2016 paper
☆28Nov 29, 2019Updated 6 years ago
matpalm / common-crawl-quick-hacks
View on GitHub
common crawl quick hack examples
☆19Feb 11, 2015Updated 11 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lintool / MapReduceAlgorithms
View on GitHub
Data-Intensive Text Processing with MapReduce
☆628Mar 3, 2021Updated 5 years ago
teh / ppjoin
View on GitHub
Hacky implementation of ppjoin by Chuan Xia et Al
☆19Aug 24, 2014Updated 11 years ago
bhaskar-mitra / Demos
View on GitHub
A bag of miscellaneous demos!
☆13Feb 5, 2017Updated 9 years ago
laura-dietz / tutorial-kb4ir
View on GitHub
Resources for the Tutorial on "Utilizing Knowledge Bases in Text-centric Information Retrieval"
☆25Sep 18, 2016Updated 9 years ago
kijiproject / kiji-express
View on GitHub
☆16Sep 26, 2014Updated 11 years ago
howech / jruby-flume
View on GitHub
JRuby plugin for flume (jRubySource, jRubySink, jRubyDecorator).
☆17Mar 18, 2011Updated 15 years ago
pranab / chargeur
View on GitHub
CSV file loader for HBase and Cassandra
☆17May 12, 2021Updated 5 years ago