UKPLab / arxiv2025-inherent-limits-plms
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities"
☆13Updated 2 months ago
Alternatives and similar repositories for arxiv2025-inherent-limits-plms:
Users that are interested in arxiv2025-inherent-limits-plms are comparing it to the libraries listed below
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 5 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆42Updated 2 months ago
- [Arxiv] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆12Updated 2 weeks ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆38Updated last month
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆119Updated 7 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆13Updated this week
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆19Updated last month
- ☆13Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆68Updated 6 months ago
- Knowledge Unlearning for Large Language Models☆22Updated 3 weeks ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆28Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 8 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆35Updated 6 months ago
- ☆28Updated last year
- ☆42Updated this week
- Exploring Model Kinship for Merging Large Language Models☆23Updated last month
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆26Updated last month
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated last year
- The official implementation of Cross-Task Experience Sharing (COPS)☆21Updated 5 months ago
- LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆17Updated this week
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆36Updated last week
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆50Updated 3 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 10 months ago
- CS194-196 Course Project☆13Updated last month
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆39Updated 4 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated last week
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 5 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated last month
- ☆32Updated 3 weeks ago