chrislemke / deep-martinLinks
Text simplification for a better world: Deep-Martin Transformer π€
β22Updated last year
Alternatives and similar repositories for deep-martin
Users that are interested in deep-martin are comparing it to the libraries listed below
Sorting:
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated 2 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β36Updated 3 years ago
- Using short models to classify long textsβ21Updated 2 years ago
- MAFAND-MTβ57Updated last year
- Abstractive and Extractive Text summarization using Transformers.β84Updated 2 years ago
- β31Updated 2 years ago
- β48Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiβ¦β39Updated 2 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β32Updated 4 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β79Updated last year
- Embedding Recycling for Language modelsβ39Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"β46Updated 2 years ago
- Crosslingual Question Answering for African Languagesβ31Updated 9 months ago
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extractionβ81Updated 4 years ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"β27Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β152Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.β57Updated 2 years ago
- A series of notebooks demonstrating how to build simple NLP web apps with Gradio and Hugging Face transformersβ45Updated 3 years ago
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI β¦β28Updated 4 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentencesβ63Updated last year
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)β61Updated 2 years ago
- Tools for managing datasets for governance and training.β85Updated last month
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qβ¦β88Updated last year
- Financial Domain Question Answering with pre-trained BERT Language Modelβ126Updated last week
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"β59Updated 8 months ago
- Developing tools to automatically analyze datasetsβ74Updated 8 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.β59Updated 11 months ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpusβ15Updated last year
- Explainable Zero-Shot Topic Extractionβ63Updated 11 months ago