madaan / llm-reasoning-tutorialLinks
Resources for few-shot reasoning tutorial
☆15Updated last year
Alternatives and similar repositories for llm-reasoning-tutorial
Users that are interested in llm-reasoning-tutorial are comparing it to the libraries listed below
Sorting:
- Parameter-Efficient Fine-Tuning for Foundation Models☆88Updated 5 months ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆24Updated 2 weeks ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆84Updated 9 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆95Updated 8 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆73Updated 6 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆27Updated last year
- A Sober Look at Language Model Reasoning☆81Updated 2 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆19Updated 2 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆48Updated last year
- A Survey of Personalization: From RAG to Agent☆63Updated 3 weeks ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆26Updated 9 months ago
- ☆53Updated 6 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆62Updated 9 months ago
- All about large language models☆51Updated last year
- ☆120Updated 5 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Updated 10 months ago
- ☆50Updated 5 months ago
- Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation)…☆49Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆127Updated 5 months ago
- ☆37Updated last year
- ☆142Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆47Updated 10 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆52Updated 3 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆77Updated 5 months ago
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆35Updated last year
- Code implementation of synthetic continued pretraining☆126Updated 7 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆39Updated last year
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆71Updated 2 months ago
- ☆29Updated 9 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated last year