pranab/visitante

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pranab/visitante)

pranab / visitante

Set of Hadoop, Spark and Storm based tools for web and customer analytic

☆34

Alternatives and similar repositories for visitante

Users that are interested in visitante are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kevinweil / FileSetInputFormat
View on GitHub
A Hadoop input format for sending lists of files as keys to a mapper. Set the list of files, and an input split will be created per file…
☆16Apr 7, 2010Updated 16 years ago
mmay / PigJsonLoader
View on GitHub
A Load UDF for loading JSON files with Pig
☆15Jul 6, 2011Updated 15 years ago
pranab / beymani
View on GitHub
Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.
☆129Jan 22, 2024Updated 2 years ago
dpalbrecht / job-search-engine
View on GitHub
A job search engine and site created for the Relevance & Matching Tech community on Slack.
☆13Sep 29, 2023Updated 2 years ago
brusic / elasticsearch-hello-world-plugin
View on GitHub
Tutorial on how to create a new REST endpoint in elasticsearch
☆21Sep 9, 2011Updated 14 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
metzlerd / mavuno
View on GitHub
Mavuno: A Hadoop-Based Text Mining Toolkit
☆48Feb 7, 2012Updated 14 years ago
3scale / lua-resty-env
View on GitHub
OpenResty ENV cache
☆12Nov 16, 2017Updated 8 years ago
trulia / thoth-ml
View on GitHub
☆15Jan 3, 2015Updated 11 years ago
Kong / lua-resty-mediador
View on GitHub
Mediador, determine address of proxied request
☆11Sep 30, 2021Updated 4 years ago
RedisLabs / ReSearch
View on GitHub
Redis search and indexing in Java
☆16Sep 26, 2016Updated 9 years ago
julienledem / Pig-scripting-examples
View on GitHub
Examples of use of pig scripting languages capabilities
☆39Aug 1, 2016Updated 9 years ago
dbr / stravathings
View on GitHub
A random bunch of tools using old V1 API for www.strava.com (cycle-mapping thing)
☆17Oct 3, 2021Updated 4 years ago
tdunning / knn
View on GitHub
Large scale k-nn experiments
☆69Jul 31, 2024Updated last year
markovianhq / bonspy
View on GitHub
Convert real-time bidding (RTB) models to the AppNexus Bonsai language
☆15Oct 17, 2017Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
wilbur / Piggybank
View on GitHub
A reporistory of User-defined functions for Apache Pig
☆16Sep 20, 2010Updated 15 years ago
metamx / tranquility
View on GitHub
Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…
☆13May 3, 2019Updated 7 years ago
seratch / inputvalidator
View on GitHub
Scala Input Validator with quite readable DSL
☆15Dec 31, 2013Updated 12 years ago
neo4j-contrib / training-v2
View on GitHub
☆18Feb 14, 2026Updated 5 months ago
tweetmagik / spark-yarn
View on GitHub
Launch Spark clusters on YARN
☆24Aug 29, 2011Updated 14 years ago
milesegan / scala-hadoop-example
View on GitHub
A translation of the WordCount example from the Hadoop tutorial from Java to Scala.
☆32Jul 6, 2010Updated 16 years ago
metamx / bytebuffer-collections
View on GitHub
ByteBuffer collection classes for java and jvm-based languages.
☆34Apr 9, 2018Updated 8 years ago
mozilla-metrics / akela
View on GitHub
A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.
☆77Mar 31, 2014Updated 12 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mbseid / play-mongo-securesocial.g8
View on GitHub
Play Framework Bootstraped with Mongo Storage and SecureSocial Authentication
☆15Sep 13, 2013Updated 12 years ago
cbrew / Insults
View on GitHub
Code for the Kaggle insult competition
☆30Apr 25, 2015Updated 11 years ago
elastic / elasticsearch-hdfs
View on GitHub
Hadoop Plugin for ElasticSearch
☆63Aug 8, 2024Updated last year
kijiproject / kiji-bento
View on GitHub
Kiji BentoBox: Developer SDK for Kiji including a standalone zero-configuration HBase micro-cluster
☆25Sep 26, 2014Updated 11 years ago
hanborq / rockstor
View on GitHub
An Object Storage System implementation based on Hadoop and HBase, with similar features like S3 (Amazon Simple Storage Service).
☆19Apr 1, 2013Updated 13 years ago
uber-archive / kafka-spraynozzle
View on GitHub
A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.
☆49Dec 3, 2019Updated 6 years ago
tellapart / TellApart-Hadoop-Utils
View on GitHub
Utilities for working with Hadoop and Cascading
☆19Feb 8, 2011Updated 15 years ago
maloninc / Churros
View on GitHub
Churros is a Javascript library for sycing a local HTML 5 Web SQL Database with your server's database.
☆13Feb 27, 2011Updated 15 years ago
LinkedInAttic / white-elephant
View on GitHub
Hadoop log aggregator and dashboard
☆190Oct 29, 2013Updated 12 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
twitter / hraven
View on GitHub
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
☆129Jan 14, 2022Updated 4 years ago
edwardcapriolo / hive-geoip
View on GitHub
GeoIP Functions for hive
☆49Oct 13, 2020Updated 5 years ago
ManuelB / facebook-recommender-demo
View on GitHub
This project is the basis for a BedCon talk and should make it possible for the listener to build an own recommender.
☆74Nov 6, 2020Updated 5 years ago
pranab / sifarish
View on GitHub
Content based and collaborative filtering based recommendation and personalization engine implementation on Hadoop and Storm
☆335Nov 1, 2019Updated 6 years ago
hamishforbes / lua-resty-tlc
View on GitHub
General two level cache (lrucache + shared dict)
☆19Apr 19, 2017Updated 9 years ago
sequenceiq / yarn-monitoring
View on GitHub
Hadoop YARN monitoring with R
☆19Sep 16, 2014Updated 11 years ago
pranab / hoidla
View on GitHub
Set of real time stream processing algorithms that can be used by big data streaming platform
☆74Jul 12, 2025Updated last year