jwieting/paraphrastic-representations-at-scale

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jwieting/paraphrastic-representations-at-scale)

jwieting / paraphrastic-representations-at-scale

☆74

Alternatives and similar repositories for paraphrastic-representations-at-scale

Users that are interested in paraphrastic-representations-at-scale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / xtreme-distil-transformers
View on GitHub
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆157Dec 20, 2023Updated 2 years ago
swarnaHub / ExplaGraphs
View on GitHub
[EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
☆14Nov 5, 2022Updated 3 years ago
HKUST-KnowComp / SubeventWriter
View on GitHub
Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…
☆11Oct 16, 2022Updated 3 years ago
timoschick / dino
View on GitHub
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Aug 17, 2021Updated 4 years ago
hellohaptik / HINT3
View on GitHub
This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…
☆32Mar 24, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ikergarcia1996 / T-Projection
View on GitHub
T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.
☆13Nov 21, 2023Updated 2 years ago
tqfang / comet-deepspeed
View on GitHub
Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Jan 23, 2022Updated 4 years ago
cindyxinyiwang / multiview-subword-regularization
View on GitHub
PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"
☆26Jun 2, 2021Updated 5 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
salesforce / DocNLI
View on GitHub
☆69May 1, 2025Updated last year
dhfbk / KIND
View on GitHub
KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
☆13Jun 28, 2023Updated 3 years ago
wietsedv / gpt2-recycle
View on GitHub
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Aug 2, 2021Updated 4 years ago
nng555 / ssmba
View on GitHub
☆61Apr 19, 2022Updated 4 years ago
thestephencasper / explore_establish_exploit_llms
View on GitHub
☆31Jul 14, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
JulesBelveze / bert-squeeze
View on GitHub
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
☆85Updated this week
amazon-science / contrastive-controlled-mt
View on GitHub
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
☆22May 24, 2023Updated 3 years ago
yanlinf / UXSenti
View on GitHub
Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)
☆10Nov 4, 2019Updated 6 years ago
anthonywchen / MOCHA
View on GitHub
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16May 3, 2022Updated 4 years ago
AndreasMadsen / nlp-roar-interpretability
View on GitHub
Measuring if attention is explanation with ROAR
☆22Mar 3, 2023Updated 3 years ago
cisnlp / MEXA
View on GitHub
[ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
☆11Apr 6, 2025Updated last year
shijie-wu / crosslingual-nlp
View on GitHub
This repo supports various cross-lingual transfer learning & multilingual NLP models.
☆92Sep 13, 2023Updated 2 years ago
UKPLab / gpl
View on GitHub
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …
☆342Jul 6, 2023Updated 3 years ago
marzenakrp / demetr
View on GitHub
Repository for DEMETR: Diagnosing Evaluation Metrics for Translation
☆17Nov 29, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HKUST-KnowComp / PseudoReasoner
View on GitHub
Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…
☆11Oct 18, 2022Updated 3 years ago
sf-wa-326 / phrase-bert-topic-model
View on GitHub
☆86Dec 5, 2021Updated 4 years ago
MilaNLProc / language-invariant-properties
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
lilakk / PostMark
View on GitHub
Official repository for "PostMark: A Robust Blackbox Watermark for Large Language Models"
☆29Aug 30, 2024Updated last year
khalidsaifullaah / BERTify
View on GitHub
An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.
☆36May 18, 2023Updated 3 years ago
jonathanherzig / semantic-parsing-annotation
View on GitHub
Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"
☆20Oct 5, 2020Updated 5 years ago
coastalcph / seq2sparql
View on GitHub
Multilingual Compositional Wikidata Questions (MCWQ)
☆20Jun 12, 2023Updated 3 years ago
efficientqa / retrieval-based-baselines
View on GitHub
Tutorials on training and testing retrieval-based models (DrQA & DPR)
☆51Nov 30, 2020Updated 5 years ago
jwieting / para-nmt-50m
View on GitHub
Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…
☆105Dec 5, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
HKUST-KnowComp / AbsPyramid
View on GitHub
Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…
☆13Oct 30, 2024Updated last year
CogComp / APSI
View on GitHub
Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction
☆11Oct 19, 2020Updated 5 years ago
HKUST-KnowComp / MICO
View on GitHub
This is the code repo for Findings of EMNLP2022 paper: MICO: a multi-alternative contrastive learning framework for commonsense knowledg…
☆10Nov 29, 2022Updated 3 years ago
CPJKU / wechsel
View on GitHub
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆92Sep 12, 2024Updated last year
nateraw / spaces-docker-templates
View on GitHub
🚀🤗 A collection of templates for Hugging Face Spaces
☆35Oct 9, 2023Updated 2 years ago