Tixierae / OrangeSumLinks
The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".
☆22Updated 4 years ago
Alternatives and similar repositories for OrangeSum
Users that are interested in OrangeSum are comparing it to the libraries listed below
Sorting:
- A french sequence to sequence pretrained model☆63Updated 3 years ago
- ☆75Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- State of the art Semantic Sentence Embeddings☆100Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 3 years ago
- Question-answers, collected from Google☆131Updated 4 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 4 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Updated last week
- Explainable Zero-Shot Topic Extraction☆65Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 4 years ago
- Few-shot Named Entity Recognition☆121Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆99Updated 10 months ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆199Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆69Updated 3 years ago
- Creating class-based TF-IDF matrices☆91Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 3 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- ☆88Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Updated 4 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 3 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆204Updated 5 months ago
- ☆22Updated 3 years ago