neomoha / python-lsi-similarityLinks
A small code in python to compute semantic similarity between documents (or items) using Latent Semantic Indexing
β14Updated 11 years ago
Alternatives and similar repositories for python-lsi-similarity
Users that are interested in python-lsi-similarity are comparing it to the libraries listed below
Sorting:
- π‘βοΈοΈ β¬οΈοΈ JSON to Markdown converter - Generate Markdown from format independent JSONβ78Updated 6 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.β35Updated 2 years ago
- An index data structure for approximate string search.β23Updated 6 years ago
- Poor man's simple harvester for arXiv resourcesβ13Updated 2 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observableβ13Updated 2 years ago
- A helper library full of URL-related heuristics.β73Updated 2 months ago
- Tools for running enrichments against data stored in Datasetteβ25Updated last month
- Export/access your Hypothes.is data: annotations and profile infoβ46Updated 5 months ago
- Get the scholarly citation for any research product: software, preprint, paper, or datasetβ81Updated 2 years ago
- A Python module to discover the etymology of wordsβ151Updated last year
- Now included in rigourβ152Updated 3 weeks ago
- Taupe takes a downloaded Twitter archive ZIP file, extracts the URLs corresponding to tweets, retweets, replies, quote tweets, and liked β¦β33Updated 2 years ago
- Find and explore the shortest path between any two pages on Wikipediaβ82Updated 10 years ago
- Finds linguistic patterns effortlesslyβ39Updated 2 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscoveryβ58Updated last year
- Datasette plugin for inserting and updating dataβ20Updated last year
- A python package to simulate typographical errors.β38Updated 2 years ago
- Parse government documents into well formed JSONβ75Updated this week
- tool for collectively summarizing large discussionsβ145Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (incluβ¦β66Updated last week
- a tool to snapshot sqlite databases you don't ownβ23Updated 3 months ago
- Export contacts from the macOS Contacts app in vCard format to Markdown files with structured data.β17Updated last year
- Python wrapper library for the Datamuse APIβ82Updated 2 years ago
- Python search module for fast approximate string matchingβ54Updated 2 years ago
- A natural language date parser. (Python version of chrono.js)β25Updated 6 months ago
- Non-linear, non-hierarchical knowledge management: Helper scripts for your Zettelkasten.β19Updated 2 months ago
- Data cleaning and validation functions for names, languages, identifiers, etc.β50Updated this week
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppoβ¦β47Updated 2 years ago
- Checks and fixes URLs in code and documentation.β173Updated this week
- Extraction Toolkitβ83Updated 4 years ago