babylm / evaluation-pipeline-2025Links
☆12Updated this week
Alternatives and similar repositories for evaluation-pipeline-2025
Users that are interested in evaluation-pipeline-2025 are comparing it to the libraries listed below
Sorting:
- Morfessor EM+Prune☆10Updated 5 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- Utility for behavioral and representational analyses of Language Models☆157Updated this week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆311Updated 2 years ago
- Implementation of https://srush.github.io/annotated-s4☆500Updated last month
- Interpretability for sequence generation models 🐛 🔍☆432Updated 3 months ago
- Python Finite-State Toolkit☆57Updated last week
- This repository contains the Potsdam Textbook Corpus (PoTeC) which is a natural reading eye-tracking corpus.☆12Updated last month
- The central repo for Creole based NLU and NLG work☆18Updated 3 months ago
- The Benchmark of Linguistic Minimal Pairs☆151Updated 2 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆45Updated 5 months ago
- Catalan bert model☆12Updated 4 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆9Updated last year
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated last week
- Massively multilingual pronunciation mining☆347Updated 2 months ago
- A neural dependency parser that does its best☆16Updated this week
- Natural Language Processing Research in North American Linguistics Departments☆21Updated 4 months ago
- A repository for the 2022 Inflection Shared Task☆9Updated 3 years ago
- The CODWOE shared task invites you to compare two types of semantic descriptions: dictionary glosses and word embedding representations. …☆11Updated 3 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆271Updated last month
- Bicleaner fork that uses neural networks☆40Updated last month
- ☆35Updated 2 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆214Updated last year
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.☆103Updated last year
- Wikipedia text corpus for self-supervised NLP model training☆44Updated 3 years ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆19Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 8 months ago
- explainable Siamese sentence transformers☆12Updated last year