Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"
☆18Aug 17, 2021Updated 4 years ago
Alternatives and similar repositories for superbizarre
Users that are interested in superbizarre are comparing it to the libraries listed below
Sorting:
- ☆13Nov 28, 2025Updated 3 months ago
- Python Finite-State Toolkit☆60Updated this week
- Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).☆11May 1, 2025Updated 10 months ago
- ☆18Feb 25, 2025Updated last year
- Cross-Linguistic Norms, Ratings, and Relations for Words and Concepts☆16Mar 5, 2026Updated 2 weeks ago
- ☆14May 24, 2022Updated 3 years ago
- Simple command line XML Merge tool☆12Aug 9, 2024Updated last year
- Simple-to-use scoring function for arbitrarily tokenized texts.☆48Feb 19, 2025Updated last year
- ☆19Oct 14, 2021Updated 4 years ago
- ☆16Oct 27, 2025Updated 4 months ago
- Code for pre-training BabyLM baseline models.☆16Jun 19, 2023Updated 2 years ago
- Camel Morph’s goal is to build large open-source morphological models for Arabic and its dialects across many genres and domains.☆15Dec 8, 2024Updated last year
- running LayoutLMv2☆11Apr 27, 2022Updated 3 years ago
- Codebase for probing and visualizing multilingual models.☆49May 13, 2020Updated 5 years ago
- Morfessor EM+Prune☆10Jul 22, 2020Updated 5 years ago
- Runnable morphological analysis tools from the UniMorph project☆16Nov 19, 2018Updated 7 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Jun 14, 2024Updated last year
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Oct 28, 2022Updated 3 years ago
- ☆13Aug 29, 2020Updated 5 years ago
- ALBERT trained on Mongolian text corpus☆18Jan 10, 2021Updated 5 years ago
- C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"☆21May 8, 2017Updated 8 years ago
- Potnia is an open-source Python library designed to convert Romanized transliterations of ancient texts into Unicode representations of t…☆24Nov 22, 2025Updated 3 months ago
- ☆22Apr 14, 2025Updated 11 months ago
- Listwise Learning to Rank by Exploring Unique Ratings (WSDM 2020)☆13Nov 2, 2025Updated 4 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Sep 13, 2023Updated 2 years ago
- A comparison of human attention with computational attention mechanisms☆12Jul 3, 2020Updated 5 years ago
- Modeling the Historical Arabic Hijazi Script☆21May 28, 2022Updated 3 years ago
- decontamination☆27Mar 4, 2026Updated 2 weeks ago
- A library for computing diverse text characteristics and using them to analyze data sets and models with ease.☆41Aug 18, 2022Updated 3 years ago
- ☆20Feb 4, 2024Updated 2 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Child-Sum Tree-LSTM Implementation in PyTorch☆19Sep 19, 2017Updated 8 years ago
- ARMA cell: a modular and effective approach for neural autoregressive modeling☆17May 29, 2024Updated last year
- ☆17Jun 17, 2025Updated 9 months ago
- Code for "Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model" as published at CVPR 2021.☆14Feb 3, 2024Updated 2 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- ☆10Oct 2, 2024Updated last year
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- senselab is a Python package that simplifies building pipelines for biometric (e.g. speech, voice, video, etc) analysis.☆35Mar 13, 2026Updated last week