Softcatala / ca-text-corpus
Public domain corpus of Catalan text
☆16Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ca-text-corpus
- Deepspeech ASR Model for the Catalan Language☆17Updated 3 years ago
- Catalan bert model☆12Updated 4 years ago
- Cross-Linguistic Transcription Systems☆14Updated 7 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- The curation repository for the data behind Concepticon.☆34Updated this week
- Python Finite-State Toolkit☆45Updated last week
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- ☆10Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- A repository containing links to useful phonological software☆11Updated last year
- Gamma Agreement in Python☆43Updated 8 months ago
- A lexicon compiler for non-suffixational morphologies☆11Updated 4 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆18Updated 4 months ago
- ☆22Updated 2 years ago
- phone inventory library☆15Updated last year
- Austronesian Comparative Dictionary☆11Updated last year
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆13Updated 4 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆43Updated last year
- My public domain speech index☆10Updated 5 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- ☆27Updated this week
- BurrMill core☆21Updated 3 years ago
- Python for Linguists – a Gentle Introduction to Programming☆44Updated 8 years ago
- Python library to parse Apertium stream format☆13Updated last year
- linguistics tree drawing to SVG in python, aimed at Jupyter☆62Updated 3 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated 11 months ago