Babelscape / echoes-from-alexandriaLinks
This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Echoes from Alexandria: A Large Resource for Multilingual Book Summarization".
☆11Updated 3 months ago
Alternatives and similar repositories for echoes-from-alexandria
Users that are interested in echoes-from-alexandria are comparing it to the libraries listed below
Sorting:
- Find informative examples to efficiently (human)-evaluate NLG models.☆12Updated 2 weeks ago
- Poetry Corpora Annotated on Aesthetic Emotions☆11Updated 2 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- A software for transferring pre-trained English models to foreign languages☆18Updated 2 years ago
- ☆15Updated 2 years ago
- ParaNames: A multilingual resource for parallel names☆34Updated last year
- Data for the HIPE 2022 shared task.☆20Updated last year
- ☆13Updated 3 years ago
- Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)☆53Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 10 months ago
- Official Implementation for Seq2seq is All You Need For Coreference Resolution Paper☆16Updated last year
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Updated 2 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆24Updated 2 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 7 months ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Updated 2 years ago
- X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual …☆15Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- ☆16Updated 2 years ago
- Controllable Sentence Simplification with T5☆17Updated 2 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- GSRL is a seq2seq model for end-to-end dependency- and span-based SRL (IJCAI2021).☆18Updated 3 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- This is official code for the NAACL 2021 paper: "MelBERT: Metaphor Detection via Contextualized Late Interaction usingMetaphorical Identi…☆51Updated 2 years ago
- ☆30Updated 2 years ago
- A library of translation-based text similarity measures☆25Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆89Updated 2 years ago
- MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)☆13Updated 9 months ago