LINs-lab / ELICITLinks
[ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability
☆13Updated 10 months ago
Alternatives and similar repositories for ELICIT
Users that are interested in ELICIT are comparing it to the libraries listed below
Sorting:
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47Updated 8 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆36Updated 11 months ago
- ☆23Updated last year
- ☆17Updated 6 months ago
- ☆52Updated 11 months ago
- ☆25Updated 9 months ago
- ☆43Updated 5 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆29Updated last year
- A Sober Look at Language Model Reasoning☆92Updated 2 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Updated 10 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆75Updated 7 months ago
- ☆45Updated last month
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆34Updated last year
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …☆44Updated 6 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆149Updated 4 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 10 months ago
- ☆145Updated 4 months ago
- Code for "Variational Reasoning for Language Models"☆56Updated 4 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆62Updated 7 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120Updated 9 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- Process Reward Models That Think☆78Updated 2 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆28Updated 11 months ago
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆21Updated 3 months ago
- Exploration of automated dataset selection approaches at large scales.☆52Updated 11 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆18Updated 10 months ago
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Updated 11 months ago