pengr / IKD-MMTLinks
Our code for EMNLP'22 Oral paper "Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation".
☆30Updated last year
Alternatives and similar repositories for IKD-MMT
Users that are interested in IKD-MMT are comparing it to the libraries listed below
Sorting:
- Our code for ICCV'23 paper "CAME: Contrastive Automated Model Evaluation".☆27Updated last year
- The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)☆13Updated last year
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju☆61Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆203Updated 6 months ago
- Paper List for In-context Learning 🌷☆183Updated last year
- A Survey on Data Selection for Language Models☆233Updated last month
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆164Updated last year
- 另一个浙大健康打卡定时任务☆14Updated 3 years ago
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.☆284Updated last year
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆327Updated 2 years ago
- ☆48Updated last week
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Updated last year
- A paper list about diffusion models for natural language processing.☆182Updated last year
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Updated last year
- 测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数☆15Updated last year
- ☆118Updated last year
- ☆13Updated last year
- The repo for In-context Autoencoder☆127Updated last year
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆218Updated 2 weeks ago
- A platform to develop CTM-motivated AI architecture.☆13Updated last week
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆250Updated last month
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆16Updated 7 months ago
- ☆198Updated 7 months ago
- ☆138Updated 10 months ago
- Paper collections of retrieval-based (augmented) language model.☆232Updated last year
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models☆308Updated last year
- ☆35Updated 6 months ago
- ☆10Updated last year
- ☆24Updated 2 years ago
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆149Updated last year