python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Aug 15, 2024Updated last year
Alternatives and similar repositories for lshashing
Users that are interested in lshashing are comparing it to the libraries listed below
Sorting:
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆32Apr 8, 2025Updated 10 months ago
- Alfa Battle 2.0☆27Jan 18, 2021Updated 5 years ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆34Feb 2, 2021Updated 5 years ago
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- Causality in Knowledge Graphs☆11Oct 12, 2022Updated 3 years ago
- Simple python script that converts all Excel files (xls, xlsx, xlsm, csv) in a directory into xlsb files.☆10Mar 13, 2023Updated 2 years ago
- Python library for the simulation of probabilistic circuits.☆11Feb 1, 2026Updated last month
- Includes sample datasets for machine learning☆10Apr 1, 2017Updated 8 years ago
- A companion extension to PgBouncer that can be used to manage and run PgBouncer from Postgres☆15May 29, 2024Updated last year
- A maximum-strength name parser for record linkage.☆39Sep 3, 2025Updated 6 months ago
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆15Sep 30, 2024Updated last year
- OSX Plex Media Server autostart plist☆10Jun 23, 2021Updated 4 years ago
- RAG + Semantic Search for Apple Notes☆13Feb 27, 2025Updated last year
- The official Python library for the Writer API☆11Feb 24, 2026Updated last week
- 🪐 A framework for distributed load testing experiments☆11Mar 18, 2024Updated last year
- Acoustic-prosodic entrainment measurement in spoken dialogue and approximation of the evolution of a speaker’s a/p features.☆12Feb 26, 2024Updated 2 years ago
- CSC 424 Advanced Database Management Systems☆16Jan 1, 2020Updated 6 years ago
- Classification of human emotion using multi-modal models☆12Jun 27, 2020Updated 5 years ago
- automate mailchimp reports using google apps script and google spreadsheets☆10Mar 9, 2014Updated 11 years ago
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- Tool to pack Node.js module files and module system emulator within one source file.☆12Mar 26, 2014Updated 11 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- 2020腾讯游戏安全技术竞赛机器学习组优秀奖源码☆10Apr 16, 2020Updated 5 years ago
- Some Monte Carlo algorithms for the estimation of small probabilities associated with rare events☆11Aug 16, 2023Updated 2 years ago
- Example of a Semantic Search Engine Using Chat GPT☆10Apr 6, 2023Updated 2 years ago
- Generic API clients based on Pydantic and protocols☆13Feb 26, 2026Updated last week
- a subset of sql dialect for clickhouse db.☆13Jan 9, 2023Updated 3 years ago
- ☆10Sep 2, 2023Updated 2 years ago
- Enhancing virtual KG access over tabular data with RML and CSVW☆12Jan 7, 2023Updated 3 years ago
- Agentic coding framework powered by AGENTS.md — systematic, test-first workflows with quality gates for Cursor, Codex, Gemini CLI, and AI…☆34Feb 21, 2026Updated last week
- Research code and scripts used in the Silburt et al. (2021) EMNLP 2021 paper 'FANATIC: FAst Noise-Aware TopIc Clustering'☆11Jul 6, 2023Updated 2 years ago
- Swarming behaviour is based on aggregation of simple drones exhibiting basic instinctive reactions to stimuli. However, to achieve overal…☆12Dec 2, 2019Updated 6 years ago
- Robert C. Martin's Agile Software Development, Principles, Patterns, and Practices codes in Java☆10Dec 12, 2017Updated 8 years ago
- character recognition, textline recognition☆10Aug 31, 2019Updated 6 years ago
- Code for the paper "ZHEClean: Cleaning Dirty Knowledge Graphs using Zero Human-labeled Examples"☆10Jul 23, 2021Updated 4 years ago
- ☆10Apr 11, 2022Updated 3 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago