Tebmer / Rereading-LLM-ReasoningLinks
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for improving reasoning.
☆27Updated 9 months ago
Alternatives and similar repositories for Rereading-LLM-Reasoning
Users that are interested in Rereading-LLM-Reasoning are comparing it to the libraries listed below
Sorting:
- ☆23Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆59Updated last year
- ☆54Updated 10 months ago
- ☆50Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆98Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 8 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- ☆23Updated last year
- ☆67Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆74Updated 5 months ago
- ☆40Updated 9 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆29Updated last month
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆64Updated 9 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated last year
- Exploring limitations of LLM-as-a-judge☆19Updated last year
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆55Updated 2 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated 2 years ago
- The first dense retrieval model that can be prompted like an LM☆89Updated 4 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- ☆20Updated 5 months ago
- ☆28Updated 6 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- ☆57Updated last year
- ☆127Updated last year
- ☆47Updated last year
- ☆16Updated 5 months ago