yaoxingcheng/TLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yaoxingcheng/TLM)

yaoxingcheng / TLM

ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework

☆255

Alternatives and similar repositories for TLM

Users that are interested in TLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shuohangwang / Cross-Thought
View on GitHub
☆47Jan 21, 2021Updated 5 years ago
sunyilgdx / NSP-BERT
View on GitHub
The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
☆230Oct 12, 2022Updated 3 years ago
microsoft / xtreme-distil-transformers
View on GitHub
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆157Dec 20, 2023Updated 2 years ago
CAMTL / CA-MTL
View on GitHub
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
☆58Aug 5, 2021Updated 4 years ago
yxuansu / TaCL
View on GitHub
[NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
☆94Jun 8, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
imgaojun / SWCC4Event
View on GitHub
Code for our ACL2022 paper "Improving Event Representation via Simultaneous Weakly Supervised Contrastive Learning and Clustering".
☆28Nov 14, 2022Updated 3 years ago
timoschick / pet
View on GitHub
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
☆1,625Jun 12, 2023Updated 3 years ago
JetRunner / BERT-of-Theseus
View on GitHub
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆316Jun 12, 2023Updated 3 years ago
smallbenchnlp / ELECTRA-DeBERTa
View on GitHub
☆16Dec 14, 2022Updated 3 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,655Oct 16, 2024Updated last year
princeton-nlp / LM-BFF
View on GitHub
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
☆727Aug 29, 2022Updated 3 years ago
rrmenon10 / ADAPET
View on GitHub
[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training
☆152Jun 10, 2022Updated 4 years ago
FreddeFrallan / Contrastive-Tension
View on GitHub
State of the art Semantic Sentence Embeddings
☆100May 22, 2022Updated 4 years ago
princeton-nlp / TRIME
View on GitHub
[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674
☆194Jun 14, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
namisan / mt-dnn
View on GitHub
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,259Mar 7, 2024Updated 2 years ago
amazon-science / sentence-representations
View on GitHub
☆79Jul 11, 2022Updated 4 years ago
princeton-nlp / DinkyTrain
View on GitHub
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
☆117Oct 27, 2022Updated 3 years ago
princeton-nlp / MADE
View on GitHub
EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering
☆68Nov 26, 2021Updated 4 years ago
microsoft / fastseq
View on GitHub
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆433Aug 17, 2022Updated 3 years ago
richarddwang / electra_pytorch
View on GitHub
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆332Jan 10, 2024Updated 2 years ago
luciusssss / why-learn-shortcut
View on GitHub
[ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?
☆16Aug 8, 2023Updated 2 years ago
txsun1997 / Black-Box-Tuning
View on GitHub
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…
☆275Nov 8, 2022Updated 3 years ago
shmsw25 / Channel-LM-Prompting
View on GitHub
An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"
☆130Apr 23, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
antofuller / configaformers
View on GitHub
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆48Nov 30, 2021Updated 4 years ago
microsoft / MPNet
View on GitHub
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆299Sep 11, 2021Updated 4 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
ocastel / exact-extract
View on GitHub
☆12Sep 2, 2021Updated 4 years ago
timoschick / dino
View on GitHub
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Aug 17, 2021Updated 4 years ago
airaria / TextPruner
View on GitHub
A PyTorch-based model pruning toolkit for pre-trained language models
☆390Aug 31, 2023Updated 2 years ago
SiyuanWangw / StepwiseQA
View on GitHub
The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".
☆22Sep 1, 2022Updated 3 years ago
Shark-NLP / CoNT
View on GitHub
[NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation
☆152May 10, 2023Updated 3 years ago
princeton-nlp / OptiPrompt
View on GitHub
[NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240
☆168Oct 7, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AkariAsai / unanswerable_qa
View on GitHub
The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".
☆28Jun 19, 2021Updated 5 years ago
AI-secure / InfoBERT
View on GitHub
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…
☆86Oct 25, 2023Updated 2 years ago
facebookresearch / KILT
View on GitHub
Library for Knowledge Intensive Language Tasks
☆978Mar 31, 2022Updated 4 years ago
laiguokun / Funnel-Transformer
View on GitHub
☆220Jun 8, 2020Updated 6 years ago
TsinghuaAI / CUGE
View on GitHub
☆54Apr 15, 2022Updated 4 years ago
uds-lsv / bert-stable-fine-tuning
View on GitHub
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
☆138Sep 6, 2023Updated 2 years ago
GEM-benchmark / NL-Augmenter
View on GitHub
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
☆786May 19, 2024Updated 2 years ago