GraphPKU / Case_or_Rule
exploring whether LLMs perform case-based or rule-based reasoning
☆21Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for Case_or_Rule
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆72Updated 9 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆67Updated last month
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆46Updated 4 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆32Updated 9 months ago
- Structured Chemistry Reasoning with Large Language Models☆31Updated 6 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆96Updated last week
- [ACL 2024] The project of Symbol-LLM☆41Updated 4 months ago
- DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆55Updated 2 weeks ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models☆40Updated 10 months ago
- ☆18Updated 2 months ago
- Multi-Agent System for Science of Science☆27Updated this week
- Code for https://arxiv.org/abs/2401.17139 (NeurIPS 2024)☆17Updated this week
- ☆17Updated 3 months ago
- This the implementation of LeCo☆27Updated 3 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆65Updated 8 months ago
- [KDD 2024]this is project for training explicit graph reasoning large language models.☆37Updated 5 months ago
- Evaluating Mathematical Reasoning Beyond Accuracy☆37Updated 7 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆52Updated 2 months ago
- ☆24Updated last month
- Benchmarking Benchmark Leakage in Large Language Models☆44Updated 5 months ago
- ☆28Updated last week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆61Updated 3 weeks ago
- ☆14Updated last month
- ☆103Updated 4 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆95Updated 2 months ago
- ☆21Updated 4 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆38Updated last month
- ☆31Updated 3 weeks ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆32Updated this week
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆21Updated 7 months ago