similarity join and search algorithms for edit distance and jaccard
☆19Dec 17, 2017Updated 8 years ago
Alternatives and similar repositories for Similarity-Search-and-Join
Users that are interested in Similarity-Search-and-Join are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ Library implementing Compressed String Dictionaries☆47Apr 25, 2022Updated 3 years ago
- It's an experiment based on 09 KDD paper, Beyond Blacklists: Learning to Detect Malicious Web Sites from Suspicious URLs☆10Jan 8, 2019Updated 7 years ago
- Implementation of the data structures described in the paper "Fast Compressed Tries using Path Decomposition".☆58Jan 27, 2023Updated 3 years ago
- continuously update cloud database papers☆83May 22, 2024Updated last year
- Stanford Sentiment Treebank machine learning & sentiment analysis library☆40Sep 23, 2013Updated 12 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A fuzzy matching & clustering library for python.☆26Jul 17, 2025Updated 9 months ago
- A C++ library for summarizing data streams☆23Jul 26, 2019Updated 6 years ago
- C++ implementations of indexing mechanisms, including a Hilbert-curve geohash based spatial index and a linear hashing table, for disk or…☆78Nov 28, 2020Updated 5 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- Data structures and algorithms that might be useful for ACM training.☆13Sep 2, 2015Updated 10 years ago
- V-gram indexing for PostgreSQL☆12Jul 30, 2025Updated 8 months ago
- A C++ library providing fast language model queries in compressed space.☆132Feb 25, 2023Updated 3 years ago
- pku nlp toolkit☆10Jun 5, 2018Updated 7 years ago
- Programming the MSHTML Web Browser Control with C++☆12Dec 7, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- This is the repository for the CMU course 67-300: Search Engines☆11Nov 8, 2023Updated 2 years ago
- An interactive red black tree application to demonstrate node insertion cases.☆13Jan 24, 2018Updated 8 years ago
- Cplusplus_sdk for TencentYoutuyun-person-face-service☆11Jan 2, 2017Updated 9 years ago
- Streaming Graph Server with partitioning☆15Aug 17, 2023Updated 2 years ago
- UCSD CSE231 Advanced Compiler - LLVM project☆12Mar 28, 2017Updated 9 years ago
- Greek treebank from the Perseus Digital Library☆12May 8, 2016Updated 9 years ago
- 🌳 A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.☆15Jun 28, 2022Updated 3 years ago
- String Distances in rust☆14Nov 21, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 人工智能与深度学习实战 - 自然语言处理篇☆18Apr 7, 2026Updated last week
- Implementation of eBWT using Prefix-free parse (PFP)☆14Jul 14, 2025Updated 9 months ago
- Python implementation of Hidden Markov Model, with demo of Chinese Part-of-Speech tagging☆16Jan 25, 2014Updated 12 years ago
- ☆13Nov 15, 2017Updated 8 years ago
- Includes a file with zstd compression in Rust☆13Feb 17, 2023Updated 3 years ago
- C++ implementation of Constant Database (CDB++)☆23Nov 12, 2010Updated 15 years ago
- Tokyo Metropolitan University Paraphrase Corpus (TMUP)☆11Jun 12, 2017Updated 8 years ago
- A blend of the compact and sparse hash table implementations.☆15Aug 20, 2021Updated 4 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Vim configuration☆14Oct 26, 2016Updated 9 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- Short Text Classification with Deep Neural Networks: An Experimental Analysis☆18Nov 30, 2018Updated 7 years ago
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 3 years ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- A variety of content chunking algorithms with a common API in rust