semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
☆28Jul 25, 2024Updated last year
Alternatives and similar repositories for semantic-sh
Users that are interested in semantic-sh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- My own playground for PLP (Programming Language Processing) using DeepLearning techniques☆19Apr 12, 2023Updated 2 years ago
- simhash算法实现海量内容查重☆14Apr 23, 2016Updated 9 years ago
- Dashboard is a collection of pre-made Grafana dashboards that let you track and analyze data from Twitter, GitHub, Docker, and Packagist.☆11Dec 27, 2022Updated 3 years ago
- A fast python implementation of the SimHash algorithm.☆27Oct 27, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Repology bulk exporter☆13Jul 4, 2019Updated 6 years ago
- This is a document concerning Data Readiness in the context of machine learning and Natural Language Processing.☆13Oct 14, 2021Updated 4 years ago
- ETL jobs that DoltHub maintained that load public data into DoltHub.☆20Mar 7, 2023Updated 3 years ago
- This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.☆15Mar 8, 2022Updated 4 years ago
- NAYN Machine Learning Projects☆10Aug 20, 2024Updated last year
- Plots a word association graph between the nouns in a given text with the adjectives and verbs in the text☆11Jul 19, 2019Updated 6 years ago
- A data processing module implemented with numpy☆10Aug 16, 2022Updated 3 years ago
- Use GPT-3 to generate git commands from English-language descriptions of what you want to do.☆11Sep 4, 2020Updated 5 years ago
- Turkish-Sentence Encoder with Quick-Thought Vectors☆11Dec 15, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Telegram bot that posts new hot stories from Designer News, shots from Dribbble and projects from Behance to telegram channel☆12Mar 4, 2023Updated 3 years ago
- Handle scheduled tasks associated with Eloquent models.☆18Mar 27, 2023Updated 3 years ago
- 🔥 Wrapper for LinkedIn's RESTful API written in Golang.☆16Aug 10, 2020Updated 5 years ago
- Fine-tuned KoGPT2 chatbot demo with translated PersonaChat (ongoing)☆13Apr 17, 2022Updated 3 years ago
- A PHP library that provides international ISO codes.☆17Apr 1, 2025Updated 11 months ago
- Framework for creating CLI apps using Python☆16May 29, 2020Updated 5 years ago
- Translation files generator for Laravel☆17Mar 17, 2026Updated last week
- A small library that wraps Keras models to pickle them.☆14Jul 17, 2018Updated 7 years ago
- Incorporate Image, Text and Tabular Data with HuggingFace Transformers☆12Mar 1, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of COO, CSR, CSC, SSS and TJDS sparse matrix formats.☆11Jul 15, 2015Updated 10 years ago
- Cool links, research papers, and open source projects related to Machine Learning applied to Soccer (MLonSoccer)☆17Jun 16, 2020Updated 5 years ago
- A practical starter for cross-platform OpenGL applications☆12Aug 3, 2020Updated 5 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- Laravel Sliding Window Rate Limiter☆20Feb 5, 2024Updated 2 years ago
- Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through use…☆27Mar 24, 2023Updated 3 years ago
- ☆13Feb 11, 2019Updated 7 years ago
- Dynamic Hashed Blocks (DHB) data structure for dynamic graphs☆12Sep 8, 2025Updated 6 months ago
- ☆13Oct 5, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Cyclical Curriculum Learning Code☆12Jul 29, 2023Updated 2 years ago
- Allow a User Model to block another User Model☆20Jan 5, 2026Updated 2 months ago
- statically generated weekly digest of articles read in Pocket☆10May 14, 2019Updated 6 years ago
- An end-to-end event extraction and summarization system.☆22Oct 13, 2020Updated 5 years ago
- 一般圖最大權匹配☆11Oct 3, 2016Updated 9 years ago
- A nuxt module to expose Vuex state in the browser URL for easy sharing☆12Aug 28, 2017Updated 8 years ago