This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Echoes from Alexandria: A Large Resource for Multilingual Book Summarization".
☆10Apr 14, 2025Updated 10 months ago
Alternatives and similar repositories for echoes-from-alexandria
Users that are interested in echoes-from-alexandria are comparing it to the libraries listed below
Sorting:
- Word Sense Linking model is designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.☆13Aug 23, 2024Updated last year
- ☆15Dec 26, 2024Updated last year
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆25Oct 15, 2025Updated 4 months ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- FENICE (Factuality Evaluation of Summarization based on Natural Language Inference and Claim Extraction) is a factuality-oriented metric …☆29Nov 29, 2024Updated last year
- ☆64Jun 10, 2025Updated 8 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated last week
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆26Feb 16, 2026Updated 2 weeks ago
- <혼자 만들면서 공부하는 파이썬> 책의 깃허브 자료실☆15Jan 14, 2026Updated last month
- Code and data supporting "NovelTM Data Sets for English-Language Fiction."☆26Dec 22, 2020Updated 5 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 9 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- A Word Level Transformer layer based on PyTorch and 🤗 Transformers.☆34Jan 31, 2024Updated 2 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 7 months ago
- 깃헙에 NLP 잔디심기 시즌 5☆10Aug 19, 2024Updated last year
- 한국어 소설 텍스트를 위한 자연어처리 라이브러리입니다. Natural Language Processing Library for Korean Literary Text. (Will be open in February, 2024)☆11Jan 16, 2024Updated 2 years ago
- ☆14Feb 19, 2024Updated 2 years ago
- Mastering-NLP-from-Foundations-to-LLMs☆10Apr 11, 2025Updated 10 months ago
- ☆10Oct 2, 2024Updated last year
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- Within-book topic modeling on HTRC feature extraction files☆23May 3, 2016Updated 9 years ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 9 months ago
- pydistinto - a Python implementation of different measures of distinctiveness for contrastive text analysis☆11May 15, 2025Updated 9 months ago
- ☆11Oct 12, 2023Updated 2 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Feb 12, 2026Updated 3 weeks ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Mar 28, 2024Updated last year
- NLP Preprocessing Pipeline Wrappers☆11May 12, 2023Updated 2 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- ☆10Sep 9, 2024Updated last year
- 《7가지 프로젝트로 배우는 LLM AI 에이전트 개발》 추가 지원 저장소☆15Apr 1, 2025Updated 11 months ago
- ☆13Nov 28, 2025Updated 3 months ago