neomoha / python-lsi-similarity
A small code in python to compute semantic similarity between documents (or items) using Latent Semantic Indexing
☆13Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-lsi-similarity
- A browser extension providing Open Access bibliographical services☆14Updated last year
- An index data structure for approximate string search.☆23Updated 5 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- The OpenCitations metadata model: documents and other material.☆12Updated 3 months ago
- Knowledge extraction from web data☆92Updated 6 years ago
- A deep learning model for extracting references from text☆25Updated last year
- Functions for analysing public patenting data.☆15Updated 6 years ago
- A PDF classifier ensemble with REST API service☆23Updated 3 years ago
- Crawling and analyzing data on Wikipedia☆16Updated 8 months ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- Reading legal authority for the last time☆34Updated 6 months ago
- Termonology Extraction Program (English Version)☆41Updated 4 months ago
- Data Server for Topic Models☆121Updated last year
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆16Updated 8 years ago
- 💡✏ ️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆67Updated 5 years ago
- Open Access PDF harvester☆35Updated 6 months ago
- Finds linguistic patterns effortlessly☆33Updated last year
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago
- A Python library for defining rule-based overrides on messy data☆12Updated this week
- Semantic Technologies for the AIDA project☆38Updated 4 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- Link Wikidata items to large catalogs☆96Updated 8 months ago
- The OpenCitations RDF Resource Browser☆11Updated this week
- Processing OpenCitations Data☆17Updated 7 years ago
- A visual timeline authoring tool that extracts temporal information from freeform text☆64Updated last year
- Extraction Toolkit☆81Updated 3 years ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 6 months ago
- Sort-friendly URI Reordering Transform (SURT) python module☆40Updated 3 months ago
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆29Updated last year