Luckfort / CD
[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
☆72Updated 2 months ago
Alternatives and similar repositories for CD:
Users that are interested in CD are comparing it to the libraries listed below
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆111Updated 11 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆43Updated 2 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆66Updated last month
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆20Updated last month
- ☆89Updated 3 weeks ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆48Updated last month
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆26Updated last month
- Function Vectors in Large Language Models (ICLR 2024)☆156Updated last month
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆66Updated 3 weeks ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated 10 months ago
- This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen,…☆48Updated 4 months ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆76Updated 5 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆89Updated 2 weeks ago
- AnchorAttention: Improved attention for LLMs long-context training☆206Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 6 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆44Updated last month
- Test-time-training on nearest neighbors for large language models☆39Updated 11 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆138Updated last month
- ☆16Updated this week
- ☆37Updated last year
- ☆96Updated 9 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆53Updated 11 months ago
- ☆151Updated 3 weeks ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- The code of RouterDC☆57Updated this week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆112Updated 3 weeks ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆54Updated 6 months ago
- Long Context Extension and Generalization in LLMs☆53Updated 6 months ago