ElvishElvis / LCA-on-the-lineView external linksLinks
LCA-on-the-line (ICML 2024 Oral)
☆13Feb 13, 2025Updated last year
Alternatives and similar repositories for LCA-on-the-line
Users that are interested in LCA-on-the-line are comparing it to the libraries listed below
Sorting:
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- Research work aimed at addressing the problem of modeling infinite-length context☆46Dec 18, 2025Updated 2 months ago
- Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence☆56Nov 11, 2025Updated 3 months ago
- [ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliabilit…☆30Feb 5, 2026Updated last week
- The loss landscape of Large Language Models resemble basin!☆36Jul 8, 2025Updated 7 months ago
- Source code of paper ''KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing''☆31Oct 24, 2024Updated last year
- ☆68Dec 8, 2025Updated 2 months ago
- This repository houses the code for the paper - "The Neglected of VLMs"☆30Dec 31, 2025Updated last month
- [ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with …☆32Jun 21, 2024Updated last year
- Official implementation for "Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning"☆12Jun 20, 2025Updated 7 months ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …☆37May 31, 2025Updated 8 months ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective