A text similarity computation using minhashing and Jaccard distance on reuters dataset
☆17Jun 11, 2018Updated 7 years ago
Alternatives and similar repositories for Text-Similarity
Users that are interested in Text-Similarity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Feb 26, 2020Updated 6 years ago
- Creative Commons Media-Fingerprint Library☆12Sep 23, 2013Updated 12 years ago
- Free programming language books☆10Jun 4, 2020Updated 5 years ago
- Extracting Entities with Limited Evidence☆16Dec 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CodFS: An Erasure-Coded Clustered Storage System for Efficient Updates and Recovery☆10Mar 31, 2015Updated 11 years ago
- Going Deeper: Infinite Deep Neural Networks☆96Nov 6, 2018Updated 7 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Library for Character/Word n-gram Analysis☆23Mar 2, 2017Updated 9 years ago
- ☆17Feb 21, 2026Updated last month
- Code from http://www.ark.cs.cmu.edu/mheilman/questions/☆12Apr 23, 2013Updated 12 years ago
- CLI for rendering text with headless chrome.☆11Jul 11, 2020Updated 5 years ago
- A plugin that connects Thermaltake controllers to FanControl☆13Mar 4, 2023Updated 3 years ago
- Validates MD5/SHA1 CheckSums on the command line.☆16Dec 22, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- cs231n learning notes☆14Oct 28, 2017Updated 8 years ago
- Demonstration of using Caffe2 inside an Android application.☆10Dec 23, 2018Updated 7 years ago
- Tables☆11May 14, 2024Updated last year
- Facial-Expression Recognition with Deep Neural Networks☆10Mar 6, 2016Updated 10 years ago
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated last year
- ☆10Jun 23, 2018Updated 7 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Jun 9, 2019Updated 6 years ago
- Slides from my talk on spaCy IRL, regarding sparse attention.☆12Jul 9, 2019Updated 6 years ago
- This is a minimal acyclic finite-state automata algorithm in Java based on the paper, "Incremental Construction of Minimal Acyclic Finite…☆20Dec 31, 2013Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Annotated corpus of Arabic tweets which mention a violence act.☆10Jun 6, 2018Updated 7 years ago
- ☆15Jan 6, 2022Updated 4 years ago
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- Python script to assemble individual Tweets from a public Twitter stream (either Gnip activity-streams format or original Twitter API for…☆12Aug 30, 2016Updated 9 years ago
- A Python module to provide software abstractions to ease accessing hyperknowledge graphs☆11Dec 19, 2024Updated last year
- Slides and code for the PyData Berlin 2018 tutorial☆16Nov 21, 2022Updated 3 years ago
- < 80 LOC Implementing Writer Pro's syntax control (with NSLinguisticTagger) that iA tried to patent☆106Dec 24, 2013Updated 12 years ago
- Resources, articles, thoughts, datasets, papers on TI tradecraft☆11Aug 24, 2018Updated 7 years ago
- Posterior inference in topic models with provably guaranteed algorithms☆21Feb 27, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16May 6, 2021Updated 4 years ago
- ☆12Aug 26, 2021Updated 4 years ago
- qdapTools is an R package that contains tools associated with the qdap package that may be useful outside of the context of text analysis…☆15May 10, 2023Updated 2 years ago
- Find duplicate text files.☆14Jan 14, 2025Updated last year
- Helper tooling for parking PyPI namespaces to combat typosquatting.☆18Jun 22, 2025Updated 9 months ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆13Feb 22, 2021Updated 5 years ago
- Simple HTML cleanup utilities☆27May 10, 2016Updated 9 years ago