hetpandya / textgenieLinks
A python package to augment text data using NLP.
☆39Updated 11 months ago
Alternatives and similar repositories for textgenie
Users that are interested in textgenie are comparing it to the libraries listed below
Sorting:
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆105Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo☆280Updated 2 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆148Updated 8 months ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆35Updated 4 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Updated 2 years ago
- Use BERT to Fill in the Blanks☆84Updated 4 years ago
- ☆184Updated 2 years ago
- Comprehensive NLP Evaluation System☆188Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- A Framework for Textual Entailment based Zero Shot text classification☆153Updated last year
- Abstractive and Extractive Text summarization using Transformers.☆86Updated 2 years ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Updated 3 months ago
- Few-shot Named Entity Recognition☆121Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 5 months ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 4 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- Build a dialog dataset from online books in many languages☆76Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Updated 2 years ago
- Tutorial for first time BERT users,☆103Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 3 years ago