hetpandya / textgenieLinks
A python package to augment text data using NLP.
β39Updated 3 months ago
Alternatives and similar repositories for textgenie
Users that are interested in textgenie are comparing it to the libraries listed below
Sorting:
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also predβ¦β70Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatioβ¦β44Updated last year
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.β102Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.β91Updated 2 months ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.β39Updated last year
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statementβ¦β16Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ33Updated 2 years ago
- This is a repository of the study performed under the Adversarial Paraphrasing Task (APT).β22Updated 3 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)β41Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)β76Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabicβ29Updated 2 years ago
- FactSumm: Factual Consistency Scorer for Abstractive Summarizationβ110Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch β¦β82Updated 2 years ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorchβ77Updated this week
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.β34Updated 2 years ago
- A tiny BERT for low-resource monolingual modelsβ31Updated 8 months ago
- β34Updated 4 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)β52Updated 2 years ago
- β76Updated 3 years ago
- β59Updated 2 years ago
- Long-context pretrained encoder-decoder modelsβ94Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- Build a dialog dataset from online books in many languagesβ73Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β62Updated last year
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.β36Updated 2 years ago
- β68Updated last month
- A collection of preprocessed datasets and pretrained models for generating paraphrases.β29Updated 3 years ago