madaan / llm-reasoning-tutorialLinks
Resources for few-shot reasoning tutorial
☆15Updated 2 years ago
Alternatives and similar repositories for llm-reasoning-tutorial
Users that are interested in llm-reasoning-tutorial are comparing it to the libraries listed below
Sorting:
- Parameter-Efficient Fine-Tuning for Foundation Models☆110Updated 10 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆103Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆28Updated 5 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28Updated last year
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆53Updated last year
- ☆17Updated 2 months ago
- "A Survey on Agent-as-a-Judge"☆87Updated 3 weeks ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Updated 4 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆40Updated 2 years ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Updated last year
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Updated last month
- ☆40Updated last year
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Updated last year
- ☆146Updated last year
- Pytorch implementation of Tree Preference Optimization (TPO) (Accepted by ICLR'25)☆26Updated 9 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆42Updated 4 months ago
- ☆64Updated 9 months ago
- ☆43Updated 5 months ago
- ☆29Updated last year
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆45Updated 3 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- A Survey of Personalization: From RAG to Agent☆99Updated 6 months ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆27Updated last year
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Updated 4 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Updated last year
- ☆31Updated 6 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆67Updated last year
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆24Updated last year
- Multimodal Graph Learning: how to encode multiple multimodal neighbors with their relations into LLMs☆67Updated last year