Tixierae / OrangeSum
The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".
☆22Updated 3 years ago
Alternatives and similar repositories for OrangeSum:
Users that are interested in OrangeSum are comparing it to the libraries listed below
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- ☆12Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 9 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 7 months ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆102Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆105Updated 10 months ago
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- French Machine Reading for Question Answering☆18Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 8 months ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated last month
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- Universal Semantic Annotator (LREC 2022)☆16Updated 3 weeks ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- ☆68Updated 3 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 9 months ago
- ☆13Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- ☆16Updated 2 years ago
- ☆74Updated 3 years ago