bojone / LST-CLUE
Ladder Side-Tuning在CLUE上的简单尝试
☆19Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for LST-CLUE
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated last year
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆56Updated 3 years ago
- 擂台赛3-大规模预训练调优比赛的示例代码与baseline实现☆38Updated 2 years ago
- ☆64Updated 6 months ago
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆66Updated 5 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆29Updated 10 months ago
- ☆59Updated last year
- ☆32Updated 3 years ago
- ☆36Updated last year
- Our code will be public soon .☆26Updated last year
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆46Updated 9 months ago
- WuDaoMM this is a data project☆66Updated 2 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆28Updated last year
- 服务器 GPU 监控程序,当 GPU 属性满足预 设条件时通过微信发送提示消息☆28Updated 3 years ago
- ☆44Updated this week
- ☆56Updated 2 years ago
- A paper list of pre-trained language models (PLMs).☆79Updated 2 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago
- Plug-and-Play Document Modules for Pre-trained Models☆25Updated last year
- Feeling confused about super alignment? Here is a reading list☆43Updated 10 months ago
- Released code for our ICLR23 paper.☆63Updated last year
- ☆71Updated 10 months ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆38Updated last year
- 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training☆88Updated last month
- ☆53Updated 4 months ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"☆40Updated 2 years ago
- ☆59Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated last year
- my commonly-used tools☆47Updated 3 months ago