English web corpus with 4M tokens and several annotation types
☆26Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for amalgum
Users that are interested in amalgum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for DISRPT2019 shared task☆12Sep 5, 2022Updated 3 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆48Aug 15, 2025Updated 7 months ago
- A tool for automatic comparison and evaluation of RST trees☆12Apr 10, 2025Updated 11 months ago
- Repository for DISRPT2021 shared task☆16Sep 5, 2022Updated 3 years ago
- ☆36Mar 3, 2026Updated 2 weeks ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆11Dec 27, 2021Updated 4 years ago
- A multilingual linked idioms data set.☆21May 24, 2018Updated 7 years ago
- Pedalion trees☆12Jan 24, 2023Updated 3 years ago
- Course Content for PSCI 3300 Political Science Research Methods, Spring 2023☆12Mar 9, 2023Updated 3 years ago
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆26Nov 5, 2019Updated 6 years ago
- Princeton WordNet Interface based on Angular.js and Rust☆15Jan 23, 2026Updated 2 months ago
- Towards a consolidated LOD vocabulary for linguistic annotations☆16Feb 14, 2026Updated last month
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Jan 13, 2026Updated 2 months ago
- Planning Seminar and 2016-2017 WS and SS Courses☆10Mar 20, 2019Updated 7 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Jun 17, 2024Updated last year
- ☆11Mar 11, 2021Updated 5 years ago
- Direct Attentive Dependency Parser☆55Mar 11, 2024Updated 2 years ago
- Turn CTS TEI corpora into CEX collection files☆12Jun 16, 2021Updated 4 years ago
- All the sources and documentation for Oracc☆17Feb 16, 2026Updated last month
- All ontologies used in NIF 2.0 (NIF-Core + vocabulary modules + helper modules)☆37Jun 22, 2017Updated 8 years ago
- A PyTorch Reimplementation of https://github.com/kentonl/e2e-coref.☆12May 4, 2019Updated 6 years ago
- ☆22Mar 29, 2025Updated 11 months ago
- data and code for Revisiting Challenges in Data-to-Text Generation with Fact Grounding [INLG19]☆11Nov 17, 2020Updated 5 years ago
- Setup for Octo and some experiments with the model☆12Apr 11, 2024Updated last year
- ☆11Jun 20, 2024Updated last year
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆29Feb 21, 2026Updated last month
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Oct 19, 2019Updated 6 years ago
- Deep Multi-Sensory Object Category Recognition Using Interactive Behavioral Exploration☆16Sep 12, 2019Updated 6 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year
- various utilities for processing Ancient Greek☆20Mar 14, 2017Updated 9 years ago
- This projects hosts an annotated dataset of 39 transcripts of United States presidential election debates annotated with argument compone…☆12Jun 3, 2019Updated 6 years ago
- utilities for validating and normalising Ancient Greek text☆23Jul 8, 2020Updated 5 years ago
- A character-wise tokenizer for morphologically rich languages☆31Sep 28, 2025Updated 5 months ago
- Text collections made available by the CLiGS group.☆24Mar 22, 2022Updated 4 years ago
- ☆15Mar 16, 2026Updated last week
- EVEVALB is a python version of Evalb which is used to score the bracket tree banks.☆16Apr 22, 2019Updated 6 years ago
- The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://co…☆32Jan 9, 2025Updated last year
- ☆10Dec 6, 2022Updated 3 years ago