Softcatala / ca-text-corpus
Public domain corpus of Catalan text
☆16Updated 3 years ago
Alternatives and similar repositories for ca-text-corpus:
Users that are interested in ca-text-corpus are comparing it to the libraries listed below
- Apertium linguistic data for Catalan☆10Updated this week
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 4 years ago
- Study on lexibank data (presenting the lexibank dataset).☆12Updated last week
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated this week
- VoxAngeles Corpus☆11Updated last year
- Cross-Linguistic Transcription Systems☆14Updated 3 months ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆23Updated last year
- Catalan bert model☆12Updated 4 years ago
- Austronesian Comparative Dictionary☆12Updated 3 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆47Updated last year
- Jason Riggle's chart of phonological features in JSON format + extras☆53Updated 9 months ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆11Updated 4 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- linguistic data on the Yongning Na language☆7Updated this week
- The curation repository for the data behind Concepticon.☆38Updated last month
- Tools for managing Catalan dictionaries☆52Updated this week
- Language Acquisition Research Tools☆41Updated last year
- A web interface for viewing ELAN and FLEx files:☆20Updated last year
- PHOIBLE Online☆42Updated 2 years ago
- Scansion tool for Spanish texts☆12Updated last year
- Python Finite-State Toolkit☆54Updated last month
- A lexicon compiler for non-suffixational morphologies☆12Updated 2 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- ☆22Updated 2 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- A living document for all things Common Voice.☆14Updated 9 months ago
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆26Updated 9 months ago