JinchaoLove / Docker_Images
Deep learning images developed from nvidia/cuda-cudnn-devel-ubuntu.
☆23Updated 2 years ago
Alternatives and similar repositories for Docker_Images:
Users that are interested in Docker_Images are comparing it to the libraries listed below
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆45Updated 2 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆103Updated 2 years ago
- The information of NLP PhD application in the world.☆36Updated 6 months ago
- ☆32Updated 3 years ago
- 计算语言学22-23学年秋季学期 课程大作业baseline实现☆37Updated 2 years ago
- Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》☆58Updated 3 years ago
- Group Meeting Record for Baobao Chang Group in Peking University☆25Updated 3 years ago
- ☆46Updated 3 years ago
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆72Updated 3 years ago
- This package implements THOR: Transformer with Stochastic Experts.☆62Updated 3 years ago
- Paradigm shift in natural language processing☆42Updated 2 years ago
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆30Updated 3 years ago
- ☆39Updated last year
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆65Updated 3 years ago
- ☆56Updated 2 years ago
- Source code of LatentOps☆78Updated last year
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆23Updated 3 years ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆64Updated last year
- ☆31Updated last year
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Updated 2 years ago
- my commonly-used tools☆51Updated 2 months ago
- 本文旨在整理文本生成领域国内外工业界和企业家的研究者和研究机构。排名不分先后。更新中,欢迎大家补充☆50Updated 4 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆82Updated 2 years ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆28Updated 8 months ago
- ☆53Updated 2 years ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆61Updated 4 months ago
- ☆15Updated 3 years ago