lexibank / lexibank-analysedLinks
Study on lexibank data (presenting the lexibank dataset).
☆14Updated 4 months ago
Alternatives and similar repositories for lexibank-analysed
Users that are interested in lexibank-analysed are comparing it to the libraries listed below
Sorting:
- Public domain corpus of Catalan text☆18Updated 3 years ago
- Semantic spaces in python☆14Updated 2 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆131Updated 5 months ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 8 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- The RadioTalk dataset of talk radio transcripts☆60Updated 4 years ago
- Finds linguistic patterns effortlessly☆38Updated 2 years ago
- Repository for the allofplos project.☆65Updated 3 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- bin files☆13Updated 7 months ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆36Updated 2 months ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated last week
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more☆20Updated 6 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- A clean and easy interface for performing nearest-neighbor lookups☆50Updated 5 years ago
- Automatically exported from code.google.com/p/guess-language☆52Updated last year
- German lemmatization with IWNLP as extension for spaCy☆24Updated 2 years ago
- The python curation library for lexibank☆20Updated 11 months ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 6 years ago
- Treex NLP framework☆32Updated last month
- A lemmatizer for Icelandic text☆17Updated 7 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- A tool for analyzing the word histories of a text.☆34Updated 9 months ago
- Hexatomic is an extensible software for deep multi-layer annotation of linguistic corpora☆18Updated 9 months ago
- New York Times Word Innovation Types dataset☆21Updated 4 years ago
- The curation repository for the data behind Concepticon.☆39Updated this week