IBM / larimarLinks
Code for ICML 2024 paper
☆35Updated 4 months ago
Alternatives and similar repositories for larimar
Users that are interested in larimar are comparing it to the libraries listed below
Sorting:
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆63Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆120Updated last week
- ☆90Updated 3 months ago
- ☆112Updated last year
- ☆42Updated 3 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆113Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆34Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆81Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆29Updated last year
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆51Updated last year
- ☆108Updated last year
- Self-Questioning Language Models☆56Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated 2 years ago
- Long Context Extension and Generalization in LLMs☆62Updated last year
- ☆74Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Updated last week
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Updated last year
- The repository contains code for Adaptive Data Optimization☆32Updated last year
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆139Updated last month
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated last year
- ☆52Updated 11 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Updated 9 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆85Updated 8 months ago
- GenRM-CoT: Data release for verification rationales☆67Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- Natural Language Reinforcement Learning☆101Updated 6 months ago
- Replicating O1 inference-time scaling laws☆92Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆74Updated last year