IBM / raven-large-language-modelsLinks
Code for I-RAVEN-X generation and experiments
☆17Updated last month
Alternatives and similar repositories for raven-large-language-models
Users that are interested in raven-large-language-models are comparing it to the libraries listed below
Sorting:
- ☆22Updated 9 months ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆15Updated 10 months ago
- ☆28Updated 9 months ago
- ☆29Updated last year
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆66Updated this week
- AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…☆17Updated 3 months ago
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆25Updated 5 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…☆11Updated 7 months ago
- We integrate discrete diffusion models with neurosymbolic predictors for scalable and calibrated learning and reasoning☆51Updated last month
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆53Updated 3 weeks ago
- Official repo of paper LM2☆46Updated 8 months ago
- The original Shared Recurrent Memory Transformer implementation☆32Updated 3 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆64Updated last year
- ☆26Updated 7 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 8 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆79Updated 5 months ago
- ☆35Updated 5 months ago
- ☆33Updated 8 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 7 months ago
- Bayes-Adaptive RL for LLM Reasoning☆40Updated 5 months ago
- minimal Energy-based transformer☆32Updated last week
- RLP: Reinforcement as a Pretraining Objective☆198Updated last month
- ☆33Updated 10 months ago
- ☆42Updated last year
- KV cache compression via sparse coding☆14Updated last week
- ☆20Updated 3 months ago
- Reinforcing General Reasoning without Verifiers☆91Updated 4 months ago
- ☆50Updated 5 months ago
- ☆15Updated last month