Softcatala / ca-text-corpusLinks
Public domain corpus of Catalan text
☆17Updated 3 years ago
Alternatives and similar repositories for ca-text-corpus
Users that are interested in ca-text-corpus are comparing it to the libraries listed below
Sorting:
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated last month
- Apertium linguistic data for Catalan☆11Updated last week
- Study on lexibank data (presenting the lexibank dataset).☆13Updated 2 months ago
- Jason Riggle's chart of phonological features in JSON format + extras☆53Updated 11 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- ☆28Updated 3 weeks ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 4 years ago
- The curation repository for the data behind Concepticon.☆39Updated 3 weeks ago
- A repository containing links to useful phonological software☆12Updated 2 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆24Updated last year
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆51Updated 7 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- universal syllabification algorithms☆44Updated 2 years ago
- VoxAngeles Corpus☆12Updated last year
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Explora los Telediarios de RTVE desde 2014☆33Updated 2 months ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- linguistic data on the Yongning Na language☆8Updated last week
- Icelandic Treebank☆23Updated last year
- bilingual dictionary extractor from parallel corpora☆22Updated 10 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
- Breaks a word into syllables using an LSTM-based neural network.☆20Updated last year
- Cross-Linguistic Transcription Systems☆15Updated 6 months ago
- Scansion tool for Spanish texts☆12Updated last year
- ☆32Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Python for Linguists – a Gentle Introduction to Programming☆45Updated 9 years ago
- Python Finite-State Toolkit☆56Updated last week
- Simple CORPORA list crawler☆10Updated 8 years ago