EIT-NLP / 2D-Coordinate-System-for-ICL
[EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism". (by Anhao Zhao)
☆17Updated 5 months ago
Alternatives and similar repositories for 2D-Coordinate-System-for-ICL:
Users that are interested in 2D-Coordinate-System-for-ICL are comparing it to the libraries listed below
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆104Updated last week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆67Updated last week
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆14Updated 4 months ago
- ☆13Updated 8 months ago
- The official code repository for PRMBench.☆68Updated last month
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆15Updated 2 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆72Updated 7 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆46Updated 4 months ago
- ☆65Updated last year
- ☆30Updated 5 months ago
- ☆43Updated 5 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆56Updated 3 months ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆32Updated 3 months ago
- GenRM-CoT: Data release for verification rationales☆53Updated 5 months ago
- Collections of RLxLM experiments using minimal codes☆12Updated last month
- FeatureAlignment = Alignment + Mechanistic Interpretability☆28Updated 3 weeks ago
- [ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"☆25Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- ☆25Updated 10 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 5 months ago
- ☆30Updated last week
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆14Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆119Updated 6 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆24Updated 3 weeks ago
- ☆54Updated 5 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆28Updated last month
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆14Updated 3 months ago
- ☆37Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆13Updated 11 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 4 months ago