Tixierae / OrangeSum
The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".
☆22Updated 3 years ago
Alternatives and similar repositories for OrangeSum:
Users that are interested in OrangeSum are comparing it to the libraries listed below
- A french sequence to sequence pretrained model☆59Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 11 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- Repro is a library for easily running code from published papers via Docker.☆40Updated last year
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- ☆75Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated 2 months ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- ☆22Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 10 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Training T5 to perform numerical reasoning.☆23Updated 3 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- ☆25Updated 5 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago