The ScriptBase Corpus
☆47May 17, 2018Updated 7 years ago
Alternatives and similar repositories for scriptbase
Users that are interested in scriptbase are comparing it to the libraries listed below
Sorting:
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆21Jan 17, 2020Updated 6 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- A Parallel Russian-Simple Russian Dataset☆15Mar 30, 2023Updated 2 years ago
- code☆14Jun 21, 2020Updated 5 years ago
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated last month
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Feb 21, 2025Updated last year
- Arabic data☆15Nov 18, 2025Updated 3 months ago
- Codebase for public release of the plug-and-blend framework.☆23Mar 29, 2022Updated 3 years ago
- RWebPPL, an R interface to Webppl http://webppl.org☆22Oct 23, 2023Updated 2 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆24Aug 23, 2019Updated 6 years ago
- Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner☆29May 11, 2019Updated 6 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Oct 1, 2024Updated last year
- ☆33Nov 25, 2020Updated 5 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- A CardDAV to IP phones converter for Node.js (AVM FRITZ!Box, Snom XCAP, Yealink)☆14Sep 30, 2025Updated 5 months ago
- Shell script to manage multiple Microsoft Teams profiles on Linux.☆12Mar 3, 2021Updated 4 years ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆10Dec 27, 2021Updated 4 years ago
- maps are everything.☆10Jul 3, 2025Updated 7 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- ☆13Feb 17, 2026Updated last week
- Old book pages (with groundtruth), formerly used for OCR studies. There are several versions of the set (concerning resolution and binari…☆15Aug 25, 2017Updated 8 years ago
- Dutch abusive language data☆11Sep 23, 2023Updated 2 years ago
- Trains small LMs. Designed for training on SimpleStories☆12Sep 15, 2025Updated 5 months ago
- Repository for the "Computational Cognitive Modeling and Linguistic Theory" (Brasoveanu & Dotlacil 2020) book☆48Aug 1, 2021Updated 4 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆39Updated this week
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- Patterns in NYT production from 1987 to 2007☆11Nov 6, 2017Updated 8 years ago
- Code and data associated with our LREC 2018 and COLING 2018 papers on converting between emotion formats☆10Dec 15, 2022Updated 3 years ago
- Random forests for longitudinal data using stochastic semiparametric miced-model☆11May 15, 2022Updated 3 years ago
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- Library to easy handle Djvu files in swift iOS☆11Sep 1, 2023Updated 2 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago
- OCaml PPX extension for automatically generating Irmin types☆11Jan 14, 2020Updated 6 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Sep 30, 2019Updated 6 years ago
- Pytorch implementation of standard metrics for clustering☆10Mar 21, 2023Updated 2 years ago