ngrams-dev / general
NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and issues.
☆16Updated last year
Alternatives and similar repositories for general:
Users that are interested in general are comparing it to the libraries listed below
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction☆32Updated 3 months ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆33Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated last month
- 📑 Python Package to reconstruct the original continuous text from PDFs with language models☆32Updated last year
- A free, fast, community-focused transcription tool to transcribe texts in Latin, French, German, and Italian into IPA.☆11Updated 2 years ago
- Gutenberg cache and query library☆35Updated 5 months ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆111Updated this week
- 🦦 weasel: A small and easy workflow system☆71Updated 6 months ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated last month
- ☆18Updated 2 years ago
- OWL2 representation in Rust☆15Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆72Updated last year
- Context-sensitive word embeddings with subwords. In Rust.☆86Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆55Updated 4 months ago
- 🧭 Resolve, visualize and browse the content of any SPARQL endpoint☆14Updated last year
- RDF library implemented in Rust☆28Updated 4 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated this week
- The curation repository for the data behind Concepticon.☆37Updated this week
- Multi Tier Annotation Search☆12Updated 8 months ago
- The NLP Bias Identification Toolkit☆36Updated last year
- Manifests of the public domain images uploaded to Flickr Commons, with descriptive information about the books they were taken from.☆74Updated 10 years ago
- Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpu…☆19Updated 2 years ago
- Python Multilingual Ucrel Semantic Analysis System☆31Updated 5 months ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Updated 4 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 6 years ago
- Python Finite-State Toolkit☆47Updated last week
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- an interactive visual tool for exploring ideologies of political parties from up to date WikiData, using SPARQL, D3js, and PixiJS☆16Updated 3 years ago