☆879May 24, 2024Updated 2 years ago
Alternatives and similar repositories for R-Drop
Users that are interested in R-Drop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- R-Drop方法在中文任务上的简单实验☆90Mar 2, 2022Updated 4 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,651Oct 16, 2024Updated last year
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆541Dec 10, 2021Updated 4 years ago
- SimCSE在中文任务上的简单实验☆605Aug 7, 2023Updated 2 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,159Jan 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,108May 9, 2024Updated 2 years ago
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,044Mar 19, 2024Updated 2 years ago
- 简单的向量白化改善句向量质量☆486Jun 17, 2021Updated 4 years ago
- OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…☆25May 10, 2024Updated 2 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,701May 8, 2023Updated 3 years ago
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆535May 19, 2021Updated 5 years ago
- RoFormer V1 & V2 pytorch☆526May 18, 2022Updated 4 years ago
- NEZHA: Neural Contextualized Representation for Chinese Language Understanding☆259Aug 13, 2021Updated 4 years ago
- DeepIE: Deep Learning for Information Extraction☆1,938Dec 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,301May 16, 2023Updated 3 years ago
- code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer☆1,003May 10, 2022Updated 4 years ago
- Adversarial Training for Natural Language Understanding☆252Sep 6, 2023Updated 2 years ago
- Named Entity Recognition as Dependency Parsing☆351Aug 16, 2023Updated 2 years ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆231Oct 12, 2022Updated 3 years ago
- ☆20Feb 26, 2021Updated 5 years ago
- keras implement of transformers for humans☆5,417Nov 11, 2024Updated last year
- Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`☆679Jun 12, 2023Updated 2 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,792Jul 22, 2024Updated last year
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,982Nov 21, 2022Updated 3 years ago
- Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"☆442Feb 2, 2022Updated 4 years ago
- 3000000+语义理解 与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆314Oct 11, 2022Updated 3 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,262Feb 6, 2026Updated 4 months ago
- Must-read papers on prompt-based tuning for pre-trained language models.☆4,314Jul 17, 2023Updated 2 years ago
- Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"☆12May 8, 2023Updated 3 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,212Apr 19, 2026Updated last month
- FewCLUE 小样本学习测评基准,中文版☆518Sep 21, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 以词为基本单位的中文BERT☆476Nov 18, 2021Updated 4 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,257Mar 7, 2024Updated 2 years ago
- Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"☆567Jul 26, 2023Updated 2 years ago
- using bilstm-crf,bert and other methods to do sequence tagging task☆415Jun 12, 2023Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,435Apr 19, 2026Updated last month
- Code for using and evaluating SpanBERT.☆905Jul 25, 2023Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,226Sep 30, 2025Updated 8 months ago