lexibank / lexibank-analysedLinks
Study on lexibank data (presenting the lexibank dataset).
☆14Updated 4 months ago
Alternatives and similar repositories for lexibank-analysed
Users that are interested in lexibank-analysed are comparing it to the libraries listed below
Sorting:
- Public domain corpus of Catalan text☆18Updated 3 years ago
- Semantic spaces in python☆14Updated 2 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆49Updated 2 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Python package for converting xml and epubs to text files☆34Updated 5 years ago
- Analyse rhyme scheme, metre and form of poems☆132Updated 4 years ago
- Finds linguistic patterns effortlessly☆37Updated last year
- bin files☆13Updated 6 months ago
- Scansion tool for Spanish texts☆12Updated last year
- A workflow system for Natural Language Processing.☆22Updated 5 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Course in Natural Language Processing and Applications☆10Updated 2 years ago
- A language evolution simulator, using realistic phonetic changes.☆38Updated 2 years ago
- The NLG tool for Finnish☆23Updated last year
- Automatically exported from code.google.com/p/guess-language☆52Updated last year
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Dataset used to analyze user preferences of podcast summaries☆8Updated 2 years ago
- Calculates the word error rate of two strings, and the result is written into beautify HTML.☆20Updated 5 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated this week
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- The curation repository for the data behind Concepticon.☆39Updated 2 weeks ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 5 years ago
- a latex cheat sheet with ipython commands and shortcuts☆10Updated 11 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Feature set algebra for linguistics☆17Updated 2 months ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- ☆16Updated 3 years ago