madaan / llm-reasoning-tutorialLinks
Resources for few-shot reasoning tutorial
☆15Updated 2 years ago
Alternatives and similar repositories for llm-reasoning-tutorial
Users that are interested in llm-reasoning-tutorial are comparing it to the libraries listed below
Sorting:
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆103Updated last year
- Parameter-Efficient Fine-Tuning for Foundation Models☆110Updated 10 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Updated 4 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28Updated last year
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆28Updated 5 months ago
- Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation)…☆62Updated 7 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- ☆43Updated 5 months ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆53Updated last year
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆84Updated last year
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Updated 11 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆40Updated 2 years ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Updated last year
- Pytorch implementation of Tree Preference Optimization (TPO) (Accepted by ICLR'25)☆26Updated 9 months ago
- This repository is for our survey paper: "A Comprehensive Survey on Multimodal RAG: All Combinations of Modalities as Input and Output"☆44Updated 2 months ago
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Updated last month
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆42Updated 4 months ago
- "A Survey on Agent-as-a-Judge"☆87Updated 3 weeks ago
- A curated list of the latest advancements, papers, tools, and datasets for **Multimodal Retrieval-Augmented Generation (RAG)**. Multimoda…☆47Updated 2 months ago
- ☆70Updated 7 months ago
- The official repository for "Rongsheng Wang's Arxiv Template"☆55Updated 9 months ago
- A comprehensive paper list of Table-based Question Answering.☆36Updated 2 years ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 10 months ago
- One-shot Entropy Minimization☆188Updated 7 months ago
- ☆25Updated 9 months ago
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆45Updated 3 months ago
- ☆17Updated 2 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Updated 5 months ago
- ☆152Updated last year