[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
☆83Jan 22, 2025Updated last year
Alternatives and similar repositories for CD
Users that are interested in CD are comparing it to the libraries listed below
Sorting:
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47May 11, 2025Updated 10 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆27Mar 23, 2025Updated 11 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- PyTorch code for WWW 19 paper: On Attribution of Recurrent Neural Network Predictions via Additive Decomposition☆11Mar 18, 2021Updated 5 years ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 5 months ago
- ☆19Mar 25, 2025Updated 11 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆12Jan 19, 2026Updated 2 months ago
- ☆19Jan 3, 2025Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- JoinAI是一个开源仓库,专注于算法工程能力的培养,包括工程和数学原理的整理☆11Apr 20, 2025Updated 11 months ago
- ☆20Oct 12, 2024Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated last year
- ☆13Jan 22, 2025Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- ☆12Sep 23, 2023Updated 2 years ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆26Feb 25, 2025Updated last year
- ☆33Oct 13, 2025Updated 5 months ago
- Steering LLM Thinking with Budget Guidance☆27Feb 19, 2026Updated last month
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆64Mar 9, 2026Updated last week
- ☆21Jul 1, 2024Updated last year
- ☆36Oct 10, 2024Updated last year
- ☆28Feb 10, 2025Updated last year
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners☆26Dec 11, 2024Updated last year
- FuseAI Project☆592Jan 25, 2025Updated last year
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆16Apr 3, 2025Updated 11 months ago
- ☆33May 9, 2025Updated 10 months ago
- ☆13Jan 19, 2026Updated 2 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- ☆25Feb 20, 2026Updated last month
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆21Jan 24, 2026Updated last month
- ☆44May 17, 2020Updated 5 years ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆84Nov 27, 2024Updated last year
- ☆104Aug 8, 2024Updated last year
- 3D Traffic Light & Sign Dataset☆25Mar 24, 2025Updated 11 months ago
- Code for paper "Personalized Counterfactual Fairness in Recommendation" (a.k.a. "Towards Personalized Fairness based on Causal Notion")☆18Nov 5, 2021Updated 4 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆126May 7, 2024Updated last year