mnielsen/Pregel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mnielsen/Pregel)

mnielsen / Pregel

Toy single-machine implementation of the Pregel graph-based framework

☆119

Alternatives and similar repositories for Pregel

Users that are interested in Pregel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mnielsen / cognitive_tools
View on GitHub
Rough in-progress notes on cognitive tools
☆31May 14, 2013Updated 13 years ago
mnielsen / ec2_tools
View on GitHub
Deprecated. Formerly: scripts to make it easier to set up and manipulate clusters at Amazon EC2
☆110Jul 26, 2012Updated 14 years ago
mnielsen / mini_qa
View on GitHub
Toy question answering program. Aimed at "Who ....?" questions, e.g., "Who invented the C programming language?"
☆38Jan 8, 2017Updated 9 years ago
sujitpal / hia-examples
View on GitHub
Hadoop In Action Examples
☆40Apr 26, 2021Updated 5 years ago
viatoriche / microservices
View on GitHub
Microservices builder for python
☆15Mar 5, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alienrobotwizard / varaha
View on GitHub
Machine learning and natural language processing with Apache Pig
☆53Dec 17, 2013Updated 12 years ago
petewarden / MLloWorld
View on GitHub
Shows how to write a simple data contest entry for Kaggle, using scikit-learn for machine learning algorithms
☆18Nov 18, 2011Updated 14 years ago
apache / hama
View on GitHub
Mirror of Apache Hama
☆133Feb 11, 2020Updated 6 years ago
pierre / hfind
View on GitHub
Find implementation for Hadoop
☆17Sep 9, 2015Updated 10 years ago
shilad / PyVowpal
View on GitHub
Python wrapper for the Vowpal Wabbit machine learning library.
☆52Jul 19, 2013Updated 13 years ago
apache / giraph
View on GitHub
Mirror of Apache Giraph
☆620Apr 14, 2023Updated 3 years ago
tomslabs / avro-utils
View on GitHub
Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming
☆26Sep 10, 2013Updated 12 years ago
gparker / vowpal_wabbit
View on GitHub
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
☆57Aug 1, 2024Updated last year
DataLucence / images
View on GitHub
Material for the DataLucence:Images course
☆10Jun 14, 2017Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wilbur / Piggybank
View on GitHub
A reporistory of User-defined functions for Apache Pig
☆16Sep 20, 2010Updated 15 years ago
farr / mcmc-clojure
View on GitHub
A library for MCMC computations in Clojure.
☆20Jul 13, 2012Updated 14 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
cjdd3b / citizen-quotes
View on GitHub
A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.
☆26Aug 27, 2012Updated 13 years ago
ogrisel / pignlproc
View on GitHub
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
☆163Nov 8, 2022Updated 3 years ago
decultured / Python-Language-Detector
View on GitHub
Python Language Detector
☆16Jul 19, 2013Updated 13 years ago
Yelp / mrjob
View on GitHub
Run MapReduce jobs on Hadoop or Amazon Web Services
☆2,611Apr 2, 2026Updated 3 months ago
simplegeo / tablesnap
View on GitHub
Uses inotify to monitor Cassandra SSTables and upload them to S3
☆29Nov 17, 2011Updated 14 years ago
sd4324530 / webChat
View on GitHub
使用websocket实现网页多人聊天
☆17Oct 19, 2018Updated 7 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
googlearchive / solutions-apache-hive-and-pig-on-google-compute-engine
View on GitHub
This sample app will get up and running quickly with Hive and/or Pig on a Hadoop cluster on Google Compute Engine. For more information …
☆19Jan 9, 2018Updated 8 years ago
allenai / brat
View on GitHub
brat rapid annotation tool (brat) - for all your textual annotation needs
☆10Feb 3, 2018Updated 8 years ago
klbostee / dumbo
View on GitHub
Python module that allows one to easily write and run Hadoop programs.
☆1,030Jan 9, 2018Updated 8 years ago
saffsd / geniatagger
View on GitHub
- part-of-speech tagging, shallow parsing, and named entity recognition for biomedical text -
☆23Sep 7, 2010Updated 15 years ago
revsys / django-tracer
View on GitHub
Generate a UUID on all Django requests for traceability
☆14Jul 31, 2018Updated 7 years ago
utcompling / OpenNLP-Models
View on GitHub
A project for code to create models from existing corpora and distribute models.
☆42Apr 11, 2012Updated 14 years ago
bwmcadams / mongodb_beaker
View on GitHub
Beaker caching / session plugin for MongoDB
☆21Jan 21, 2011Updated 15 years ago
igrigorik / language_detector
View on GitHub
ruby language detection library using n-gram
☆61Oct 26, 2016Updated 9 years ago
alexbowe / keyphrase
View on GitHub
Key phrase extraction using Hadoop + Dumbo + NLTK
☆15Mar 10, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
maet3608 / nuts-flow
View on GitHub
A simple dataflow framework in Python
☆18Mar 4, 2021Updated 5 years ago
endpnt / andoc
View on GitHub
collaborative web tool to enrich content
☆11Nov 13, 2011Updated 14 years ago
echen / scaldingale
View on GitHub
Movie recommendations and more in MapReduce and Scalding
☆117Feb 11, 2013Updated 13 years ago
MrChrisJohnson / CollabStream
View on GitHub
Parallelized Online Matrix Factorization for Collaborative Filtering using Stochastic Gradient Descent
☆44May 6, 2016Updated 10 years ago
electrum / hive-serde
View on GitHub
JSON Serde for Hive
☆22Oct 13, 2011Updated 14 years ago
Cascading / CoPA
View on GitHub
Cascading plus City of Palo Alto open data
☆29Mar 3, 2013Updated 13 years ago
michaelfairley / mincemeatpy
View on GitHub
Lightweight MapReduce in python
☆481May 6, 2021Updated 5 years ago