Spico197 / watchmenLinks

😎 A simple and easy-to-use toolkit for GPU scheduling.

☆45

Alternatives and similar repositories for watchmen

Users that are interested in watchmen are comparing it to the libraries listed below

Sorting:

pkunlp-icler / ChildTuning
☆33Updated 4 years ago
RenShuhuai-Andy / gpu_lurker
服务器 GPU 监控程序，当 GPU 属性满足预设条件时通过微信发送提示消息
☆32Updated 4 years ago
RunxinXu / ChildTuning
Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》
☆62Updated 4 years ago
TobiasLee / Awesome-Efficient-PLM
Must-read papers on improving efficiency for pre-trained language models.
☆105Updated 3 years ago
ZhuiyiTechnology / GAU-alpha
基于Gated Attention Unit的Transformer模型（尝鲜版）
☆98Updated 2 years ago
RunxinXu / ContrastivePruning
Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》
☆25Updated 3 years ago
JingfengYang / Multi-modal-Deep-Learning
☆73Updated 3 years ago
nuaa-nlp / paper-reading
☆45Updated last week
Spico197 / REx
🎮 A toolkit for Relation Extraction and more...
☆24Updated 6 months ago
JetRunner / MetaDistil
Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".
☆87Updated 3 years ago
MANGA-UOFA / LMReward
The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"
☆31Updated 2 years ago
CMMMU-Benchmark / CMMMU
☆48Updated last year
lemon0830 / promptCSE
code for promptCSE, emnlp 2022
☆11Updated 2 years ago
DRSY / EMO
[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)
☆126Updated last year
BAAI-WuDao / WuDaoMM
WuDaoMM this is a data project
☆74Updated 3 years ago
BenfengXu / KNNPrompting
Released code for our ICLR23 paper.
☆66Updated 2 years ago
HKUNLP / efficient-attention
[EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling
☆87Updated 2 years ago
wangpf3 / consistent-CoT-distillation
☆43Updated 2 years ago
morningmoni / UniPELT
Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022
☆63Updated 3 years ago
GeneZC / MiniMoE
Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"
☆29Updated 2 years ago
yikangshen / MoA
Mixture of Attention Heads
☆51Updated 3 years ago
simtony / mlrunner
A light-weight script for maintaining a LOT of machine learning experiments.
☆92Updated 3 years ago
TsinghuaAI / CUGE
☆54Updated 3 years ago
lancopku / DynamicKD
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
☆41Updated 3 years ago
kugwzk / DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆31Updated 2 years ago
twinkle0331 / LGTM
[ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…
☆38Updated 2 years ago
yhy1117 / X-Mixup
Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".
☆21Updated 3 years ago
BohanLi0110 / NLP-DA-Papers
☆23Updated 4 years ago
ymcui / ChatGPT-in-Academia
Policies of scientific publisher and conferences towards large language model (LLM), such as ChatGPT
☆75Updated 2 years ago
princeton-nlp / DinkyTrain
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
☆114Updated 3 years ago