IBM / raven-large-language-modelsLinks
Code for I-RAVEN-X generation and experiments
☆19Updated 4 months ago
Alternatives and similar repositories for raven-large-language-models
Users that are interested in raven-large-language-models are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Updated last year
- KV cache compression via sparse coding☆17Updated 3 months ago
- Machine Learning from Human Preferences☆26Updated last week
- ☆30Updated last year
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆25Updated 8 months ago
- ☆46Updated 7 months ago
- ☆29Updated 3 months ago
- When Reasoning Meets Its Laws☆35Updated last month
- ☆36Updated 11 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated 2 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Updated 2 weeks ago
- ☆27Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Updated this week
- Official repo of paper LM2☆46Updated last year
- ☆33Updated last year
- Resa: Transparent Reasoning Models via SAEs☆47Updated 4 months ago
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆40Updated 9 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 6 months ago
- ☆18Updated 7 months ago
- ☆19Updated 6 months ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆67Updated last week
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- ☆144Updated 9 months ago
- Official Code Release for "Training a Generally Curious Agent"☆44Updated 8 months ago
- ☆35Updated 8 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆47Updated 6 months ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆65Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Updated 10 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆88Updated 8 months ago