IBM / larimarLinks
Code for ICML 2024 paper
☆32Updated last month
Alternatives and similar repositories for larimar
Users that are interested in larimar are comparing it to the libraries listed below
Sorting:
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆59Updated last year
 - [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆109Updated 3 months ago
 - ☆108Updated last year
 - ☆77Updated this week
 - Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 11 months ago
 - Sotopia-RL: Reward Design for Social Intelligence☆43Updated 2 months ago
 - Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆142Updated last year
 - [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆132Updated 2 months ago
 - Reinforcing General Reasoning without Verifiers☆91Updated 4 months ago
 - ☆74Updated last year
 - Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
 - ☆103Updated last year
 - RL Scaling and Test-Time Scaling (ICML'25)☆111Updated 9 months ago
 - ☆50Updated 8 months ago
 - Long Context Extension and Generalization in LLMs☆62Updated last year
 - Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆146Updated 11 months ago
 - PASTA: Post-hoc Attention Steering for LLMs☆126Updated 11 months ago
 - Directional Preference Alignment☆57Updated last year
 - B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆85Updated 5 months ago
 - Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 6 months ago
 - Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
 - Self-Alignment with Principle-Following Reward Models☆169Updated last month
 - Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆33Updated last year
 - Natural Language Reinforcement Learning☆99Updated 3 months ago
 - [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆29Updated last year
 - [𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…☆51Updated last year
 - Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75Updated 5 months ago
 - A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆65Updated 8 months ago
 - Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆61Updated last year
 - The repository contains code for Adaptive Data Optimization☆27Updated 10 months ago