microsoft/COCO-LM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/COCO-LM)

microsoft / COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

☆118

Alternatives and similar repositories for COCO-LM

Users that are interested in COCO-LM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ivanmontero / autobot
View on GitHub
Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'
☆17Mar 14, 2022Updated 4 years ago
yanzhangnlp / BSL
View on GitHub
Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)
☆30Apr 27, 2022Updated 4 years ago
lucidrains / coco-lm-pytorch
View on GitHub
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆46Mar 3, 2021Updated 5 years ago
HKUST-KnowComp / SubeventWriter
View on GitHub
Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…
☆11Oct 16, 2022Updated 3 years ago
tqfang / comet-deepspeed
View on GitHub
Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Jan 23, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
teapot123 / Fine-Grained-Entity-Typing
View on GitHub
ALIGNIE: Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation
☆20Dec 12, 2022Updated 3 years ago
microsoft / AMOS
View on GitHub
[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
☆26Jul 26, 2023Updated 2 years ago
richarddwang / electra_pytorch
View on GitHub
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆332Jan 10, 2024Updated 2 years ago
princeton-nlp / LM-BFF
View on GitHub
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
☆727Aug 29, 2022Updated 3 years ago
bloodwass / mixout
View on GitHub
Implementation of Mixout with PyTorch
☆75Dec 21, 2022Updated 3 years ago
microsoft / DeBERTa
View on GitHub
The implementation of DeBERTa
☆2,242Sep 29, 2023Updated 2 years ago
gucci-j / light-transformer-emnlp2021
View on GitHub
EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
☆34Nov 21, 2021Updated 4 years ago
YJiangcm / PromCSE
View on GitHub
[EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning
☆136Nov 17, 2023Updated 2 years ago
yzhan238 / SeedTopicMine
View on GitHub
The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.
☆14May 27, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
eagle705 / awesome-nlp-note
View on GitHub
A curated list of resources dedicated to NLP (paper, blogs, note and etc)
☆13Nov 30, 2019Updated 6 years ago
yumeng5 / RoSTER
View on GitHub
[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training
☆65Nov 12, 2021Updated 4 years ago
dreasysnail / CoCon
View on GitHub
Consistent dialogue generation
☆16Oct 26, 2022Updated 3 years ago
zjunlp / DART
View on GitHub
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
☆131Dec 7, 2022Updated 3 years ago
Giruvegan / generative-camouflaged-spam-detector
View on GitHub
generative-camouflaged-spam-detector
☆11Aug 20, 2020Updated 5 years ago
thunlp / Knowledge-Inheritance
View on GitHub
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆37Apr 24, 2022Updated 4 years ago
microsoft / BANG
View on GitHub
BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…
☆28Feb 6, 2022Updated 4 years ago
princeton-nlp / TRIME
View on GitHub
[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674
☆194Jun 14, 2023Updated 3 years ago
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
neulab / knn-transformers
View on GitHub
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆287Oct 20, 2022Updated 3 years ago
rbiswasfc / kaggle-feedback-effectiveness-3rd-place-solution
View on GitHub
3rd Place solution for Feedback Prize - Predicting Effective Arguments Kaggle competition
☆16Sep 6, 2022Updated 3 years ago
MC-BERT / MC-BERT
View on GitHub
☆99Jul 7, 2020Updated 6 years ago
malteos / scincl
View on GitHub
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)
☆78Dec 29, 2025Updated 6 months ago
Georgetown-IR-Lab / covid-neural-ir
View on GitHub
☆24Oct 23, 2020Updated 5 years ago
JohnGiorgi / DeCLUTR
View on GitHub
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…
☆377Apr 21, 2023Updated 3 years ago
oriram / spider
View on GitHub
☆55Jan 18, 2023Updated 3 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,655Oct 16, 2024Updated last year
timoschick / one-token-approximation
View on GitHub
This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.
☆12May 7, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cisnlp / MEXA
View on GitHub
[ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
☆11Apr 6, 2025Updated last year
tawatawara / atmaCup-11
View on GitHub
atmaCup #11 の Public 4th / Private 5th Solution のリポジトリです。
☆12Aug 3, 2021Updated 4 years ago
microsoft / MetaXL
View on GitHub
Meta Representation Transformation for Low-resource Cross-lingual Learning
☆41May 5, 2021Updated 5 years ago
allenai / acl2022-zerofewshot-tutorial
View on GitHub
☆291Dec 2, 2022Updated 3 years ago
CZWin32768 / XLM-Align
View on GitHub
☆36Aug 25, 2022Updated 3 years ago
HKUST-KnowComp / PseudoReasoner
View on GitHub
Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…
☆11Oct 18, 2022Updated 3 years ago
zhouj8553 / FlipDA
View on GitHub
☆67May 11, 2024Updated 2 years ago