Tixierae / OrangeSum
The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".
☆22Updated 3 years ago
Alternatives and similar repositories for OrangeSum:
Users that are interested in OrangeSum are comparing it to the libraries listed below
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 7 months ago
- ☆74Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated last week
- Repro is a library for easily running code from published papers via Docker.☆40Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago
- ☆12Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆85Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated last week
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 8 months ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- State of the art Semantic Sentence Embeddings☆98Updated 2 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 5 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆81Updated 2 months ago
- Visualise, evaluate, and manage annotated data☆33Updated 2 years ago
- ☆34Updated last year
- On Generating Extended Summaries of Long Documents☆78Updated 3 years ago
- Semantically Structured Sentence Embeddings☆66Updated 3 months ago
- ☆87Updated 2 years ago