linlu-qiu / lm-inductive-reasoningLinks
☆33Updated last year
Alternatives and similar repositories for lm-inductive-reasoning
Users that are interested in lm-inductive-reasoning are comparing it to the libraries listed below
Sorting:
- ☆57Updated 5 months ago
- ☆34Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆31Updated 9 months ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆123Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆150Updated last month
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆20Updated last year
- ☆103Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆111Updated 2 years ago
- Supporting code for ReCEval paper☆30Updated last year
- Directional Preference Alignment☆57Updated last year
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆75Updated last year
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆110Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Updated 2 years ago
- ☆14Updated 3 months ago
- ☆103Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆97Updated 4 years ago
- ☆27Updated 2 years ago
- ☆85Updated last year
- ☆112Updated 3 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆16Updated 9 months ago
- ☆164Updated 11 months ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆30Updated 2 years ago
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆25Updated 4 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆57Updated 2 years ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆40Updated 5 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆29Updated last year