Softcatala / ca-text-corpusLinks
Public domain corpus of Catalan text
☆18Updated 3 years ago
Alternatives and similar repositories for ca-text-corpus
Users that are interested in ca-text-corpus are comparing it to the libraries listed below
Sorting:
- Study on lexibank data (presenting the lexibank dataset).☆15Updated 6 months ago
- Official source for Catalan Language Models and resources made within Aina project.☆25Updated 2 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- VoxAngeles Corpus☆13Updated last month
- Cross-Linguistic Transcription Systems☆16Updated 10 months ago
- A Python package for processing research with Minimalist grammars☆21Updated 3 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆54Updated last year
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆20Updated 2 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 3 years ago
- Apertium linguistic data for Catalan☆11Updated last month
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Updated 5 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated last week
- Proposed splits for the LREC Wikipron paper☆15Updated 5 years ago
- Neural based model for automatic diacritics restoration.☆25Updated 6 years ago
- phone inventory library☆17Updated 2 years ago
- ☆13Updated 2 years ago
- A lexicon compiler for non-suffixational morphologies☆13Updated 2 months ago
- Acoustic and language models for minorised languages.☆26Updated 5 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- ☆10Updated 4 years ago
- Expected edit distance implementation using OpenFst tools☆11Updated 10 years ago
- Grapheme to phoneme converter for Estonian☆14Updated 4 years ago
- The Unicode Cookbook for Linguists☆56Updated 4 years ago
- A repository containing links to useful phonological software☆12Updated 2 years ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 5 years ago
- Gamma Agreement in Python☆45Updated last year
- Faster, modernized fork of the language identification tool langid.py☆59Updated 10 months ago