IBM / raven-large-language-modelsLinks
Code for I-RAVEN-X generation and experiments
☆15Updated 2 months ago
Alternatives and similar repositories for raven-large-language-models
Users that are interested in raven-large-language-models are comparing it to the libraries listed below
Sorting:
- ☆21Updated 5 months ago
- Forecastbench Datasets, updated nightly☆12Updated this week
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆14Updated 7 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 4 months ago
- ☆10Updated 2 weeks ago
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆19Updated last month
- KV cache compression via sparse coding☆11Updated 2 months ago
- ☆12Updated 3 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated last month
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆10Updated 5 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Updated 9 months ago
- ☆20Updated 3 months ago
- Friday Agents. App: https://chat.toolstack.run/☆11Updated 7 months ago
- ☆19Updated 5 months ago
- ☆55Updated 3 weeks ago
- This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"☆18Updated 2 months ago
- A powerful, enterprise-grade multi-agent system for advanced radiological analysis, diagnosis, and treatment planning. This system levera…☆11Updated 2 weeks ago
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆48Updated 2 weeks ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆19Updated 6 months ago
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 3 months ago
- The code implementation of Symbolic-MoE☆35Updated 4 months ago
- [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models☆57Updated last month
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆13Updated 7 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 3 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆60Updated 5 months ago
- A multimodal agent that can interact with its own PC in a multimodal manner.☆28Updated 3 weeks ago
- ☆33Updated 2 months ago
- AlgoTune is a benchmark made up of 155 math, physics, and computer science problems. The goal is write code that solves each problem, and…☆28Updated this week
- ☆23Updated last month
- ☆10Updated this week