Tixierae / OrangeSumLinks
The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".
☆23Updated 4 years ago
Alternatives and similar repositories for OrangeSum
Users that are interested in OrangeSum are comparing it to the libraries listed below
Sorting:
- A french sequence to sequence pretrained model☆62Updated 2 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- ☆75Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆94Updated 4 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 4 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆68Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 4 months ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated 2 years ago
- Question-answers, collected from Google☆129Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆87Updated 2 months ago
- ☆13Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated last year
- QED: A Framework and Dataset for Explanations in Question Answering☆117Updated 4 years ago
- Semantically Structured Sentence Embeddings☆66Updated 9 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆100Updated last year
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago