Softcatala / ca-text-corpus
Public domain corpus of Catalan text
☆16Updated 3 years ago
Alternatives and similar repositories for ca-text-corpus:
Users that are interested in ca-text-corpus are comparing it to the libraries listed below
- Cross-Linguistic Transcription Systems☆14Updated 2 months ago
- universal syllabification algorithms☆43Updated 2 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆23Updated last year
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Study on lexibank data (presenting the lexibank dataset).☆12Updated last month
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Catalan bert model☆12Updated 4 years ago
- R package for phonetic research and experimenting☆20Updated 7 months ago
- Scansion tool for Spanish texts☆11Updated last year
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- SuggestBot is an article recommender for Wikipedia☆21Updated 2 months ago
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 4 years ago
- The curation repository for the data behind Concepticon.☆38Updated last week
- ☆31Updated 3 years ago
- ☆28Updated this week
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- linguistics backend☆41Updated last year
- AUTOTYP data export☆41Updated last year
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆47Updated 4 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated 10 months ago
- A repository containing links to useful phonological software☆11Updated 2 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated last week
- American English Pronunciation Dictionary☆34Updated 6 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆51Updated 8 months ago
- A tool for automatic spelling normalization☆20Updated 4 years ago