pengr / IKD-MMT
Our code for EMNLP'22 Oral paper "Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation".
☆30Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for IKD-MMT
- Our code for ICCV'23 paper "CAME: Contrastive Automated Model Evaluation".☆26Updated 9 months ago
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆12Updated 6 months ago
- ☆115Updated last year
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju☆58Updated 6 months ago
- A paper list about diffusion models for natural language processing.☆174Updated last year
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆516Updated 2 years ago
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.☆277Updated last year
- [ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners☆128Updated last year
- [ICLR 2024] Code for the paper "Sparse MoE with Language-Guided Routing for Multilingual Machine Translation"☆8Updated 6 months ago
- Paper collections of retrieval-based (augmented) language model.☆230Updated 5 months ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Updated 7 months ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆52Updated last week
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆279Updated 3 weeks ago
- ☆16Updated 5 months ago
- Paper List for In-context Learning 🌷☆171Updated 9 months ago
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆114Updated 8 months ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆146Updated 8 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆231Updated 6 months ago
- ☆21Updated 5 months ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆298Updated 9 months ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆79Updated last year
- ☆109Updated 4 months ago
- ☆38Updated 3 months ago
- Source code of LatentOps☆77Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆76Updated this week
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆153Updated 9 months ago
- ☆116Updated 4 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆33Updated 9 months ago
- [EMNLP 2023 Main] Sparse Low-rank Adaptation of Pre-trained Language Models☆70Updated 8 months ago
- Survey on Data-centric Large Language Models☆65Updated 4 months ago