simple simhashing in hadoop with cascading
☆33May 9, 2011Updated 14 years ago
Alternatives and similar repositories for cascading-simhash
Users that are interested in cascading-simhash are comparing it to the libraries listed below
Sorting:
- Postfix Redis Lookup Table Support / Postfix Redis Map☆14Jan 4, 2023Updated 3 years ago
- A library that adds some NLP capabilities to the Lucene search engine☆50Jul 16, 2013Updated 12 years ago
- [not maintained] Example using ElasticSearch and a UI☆28Feb 17, 2011Updated 15 years ago
- Parser for KAF NAF files written in Python☆16Jul 1, 2021Updated 4 years ago
- Supervised learning of morphology☆28Jan 17, 2017Updated 9 years ago
- Ubiflux Vigor ventilation system RS485 Modbus communications with Python☆11Feb 20, 2026Updated 2 weeks ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Aug 14, 2015Updated 10 years ago
- Generator of rule-based lemmatizers (based on examples) for serveral European languages.☆29Oct 5, 2021Updated 4 years ago
- The Powerful Python CMS☆11Nov 20, 2021Updated 4 years ago
- A fast, simple, multilingual tokenizer☆29May 24, 2017Updated 8 years ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- Lightweight, multilingual natural language processing☆63Apr 8, 2013Updated 12 years ago
- A map of Durham Neighborhoods made with GeoJSON☆12Jul 6, 2024Updated last year
- Fast implementation of Gradient Boosting Machine (GBM) training algorithm.☆10Aug 26, 2019Updated 6 years ago
- A self-contained morphological analyzer (including dictionary data).☆33Jul 30, 2015Updated 10 years ago
- ☆11Nov 18, 2024Updated last year
- Exemplo de alguns design patterns implementados com a linguagem Lua.☆12Dec 30, 2010Updated 15 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- Camera streaming on Android using ffmpeg, x264, live555, forked from https://github.com/parizene/android-streamer ,but some function re…☆11Aug 26, 2018Updated 7 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Mar 27, 2014Updated 11 years ago
- Madek main web interface☆21Updated this week
- ARticated; An augmented reality application for Android☆10Apr 10, 2023Updated 2 years ago
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- A golang package that implements a distributed tracing capability inspired by Google's Dapper☆12Jan 20, 2017Updated 9 years ago
- Experimental pure Java revised simplex linear program solver (Apache 2.0 license)☆15Jun 22, 2020Updated 5 years ago
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated 2 months ago
- ☆10Jan 28, 2013Updated 13 years ago
- Code-Implementation-of-Super-Resolution-ZOO (image & video)☆10Jul 6, 2020Updated 5 years ago
- Hungarian tokenizer.☆14Mar 15, 2022Updated 3 years ago
- COVID19 Healthcare Chatbot☆11Sep 1, 2021Updated 4 years ago
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- A small personal project to learn Clojure by implementing some simple machine learning algorithms☆29Oct 12, 2009Updated 16 years ago
- Ansible role to install beehive https://github.com/muesli/beehive☆12Jun 30, 2017Updated 8 years ago
- Stream based PDF library☆15Aug 20, 2015Updated 10 years ago
- Show common areas of bike accidents to help prevent future accidents☆11Oct 18, 2017Updated 8 years ago
- Some Geb page object examples for blogging/presenting☆15Feb 6, 2011Updated 15 years ago
- Native gettext package for Go☆22Nov 8, 2017Updated 8 years ago
- 練習と実益を兼ねて製作中の C++11 向け汎用ライブラリです。 対応コンパイラ: gcc 4.5.0 以降☆21Jun 11, 2015Updated 10 years ago
- Web content transformation proxies for open data API's☆16Dec 14, 2022Updated 3 years ago