UKPLab / arxiv2025-inherent-limits-plmsLinks

Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities"

☆13

Alternatives and similar repositories for arxiv2025-inherent-limits-plms

Users that are interested in arxiv2025-inherent-limits-plms are comparing it to the libraries listed below

Sorting:

yuleiqin / RAIF
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆19Updated 3 weeks ago
shenao-zhang / reward-augmented-preference
The official implementation of Preference Data Reward-Augmentation.
☆17Updated 2 months ago
cxcscmu / Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
☆46Updated 5 months ago
Geaming2002 / Ruler
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆38Updated 9 months ago
David-Li0406 / Preference-Leakage
☆45Updated last month
MLLM-Data-Contamination / MM-Detect
This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"
☆16Updated last week
F2-Song / Weak-to-Strong-Decoding
The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆22Updated 3 weeks ago
shuzhangzhong / HybriMoE-Preview
☆16Updated 3 months ago
tianyi-lab / C3PO
Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆16Updated 3 months ago
ALT-JS / OthelloSAE
CS194-196 Course Project
☆15Updated 4 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆81Updated 2 months ago
sunblaze-ucb / AgentSynth
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
☆24Updated last month
voidism / Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆128Updated 11 months ago
zengxingchen / ChartQA-MLLM
[IEEE VIS 2024] LLaVA-Chart: Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruc…
☆68Updated 5 months ago
Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆59Updated last year
zjunlp / unlearn
[ACL 2025] Knowledge Unlearning for Large Language Models
☆39Updated 2 months ago
cyzus / thoughtsculpt
☆13Updated 7 months ago
NuoJohnChen / JudgeLRM
☆30Updated 3 months ago
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆45Updated 5 months ago
uclaml / COPS
The official implementation of Cross-Task Experience Sharing (COPS)
☆22Updated 8 months ago
THU-KEG / LongWriter-V
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆19Updated 3 months ago
neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆52Updated 7 months ago
yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆93Updated last week
rohinmanvi / Capability-Aware_and_Mid-Generation_Self-Evaluations
☆21Updated 7 months ago
Quehry / HelloBench
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆44Updated 7 months ago
TianheL / LM-Implicit-Reasoning
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆15Updated 4 months ago
XinyuanLu00 / TART
This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"
☆54Updated 2 months ago
sail-sg / Rigging-ChatbotArena
Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)
☆21Updated 4 months ago
dinobby / MAgICoRE
☆24Updated 9 months ago
Hritikbansal / sparse_feedback
☆29Updated last year