maitrix-org / llm-reasoners
A library for advanced large language model reasoning
☆1,124Updated 2 weeks ago
Related projects: ⓘ
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆689Updated last week
- ☆1,194Updated this week
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...☆1,368Updated this week
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆561Updated 8 months ago
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆860Updated 2 months ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆701Updated 3 weeks ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆858Updated 4 months ago
- Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓☆1,493Updated this week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,306Updated 5 months ago
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆523Updated 10 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,436Updated this week
- Representation Engineering: A Top-Down Approach to AI Transparency☆691Updated last month
- Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).☆748Updated last year
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆619Updated last month
- Codebase for Merging Language Models (ICML 2024)☆745Updated 4 months ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.☆1,764Updated this week
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆758Updated 2 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,352Updated 6 months ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆573Updated 9 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,513Updated last month
- Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"☆1,041Updated 6 months ago
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.☆679Updated 4 months ago
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,063Updated 8 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆1,747Updated 3 months ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆436Updated 3 weeks ago
- A repo lists papers related to LLM based agent☆974Updated last month
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,413Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,077Updated 8 months ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,024Updated last month
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,115Updated 3 weeks ago