SakanaAI/repo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SakanaAI/repo)

SakanaAI / repo

RePo: Language Models with Context Re-Positioning

☆83

Alternatives and similar repositories for repo

Users that are interested in repo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SakanaAI / DroPE
View on GitHub
Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding
☆219Jan 12, 2026Updated 6 months ago
LeonLixyz / LCLM
View on GitHub
latent context language models
☆72Jun 9, 2026Updated last month
cmu-llab / dpd
View on GitHub
Implementation of the DPD architecture and related experiments for the ACL 2024 paper "Semisupervised Neural Proto-Language Reconstructio…
☆11Jul 22, 2024Updated 2 years ago
facebookresearch / llm_souping
View on GitHub
Model souping for LLMs
☆73Nov 18, 2025Updated 8 months ago
Tencent-Hunyuan / HiLS-Attention
View on GitHub
Official code for HiLS-Attention
☆128Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yurakuratov / hidden_capacity
View on GitHub
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)
☆35Jun 14, 2025Updated last year
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆627Feb 15, 2026Updated 5 months ago
Rishit-dagli / Squeeze3D
View on GitHub
Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor
☆23Jun 12, 2025Updated last year
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
SakanaAI / natural_niches
View on GitHub
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆170Aug 25, 2025Updated 11 months ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
dongwonjo / FastKV
View on GitHub
[ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…
☆32Apr 14, 2026Updated 3 months ago
foreverlasting1202 / QuestA
View on GitHub
☆22Jan 2, 2026Updated 6 months ago
Encyclomen / HGMem
View on GitHub
Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"
☆131Jan 22, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Chunjiang-Intelligence / DeepRWKV-Reasoning
View on GitHub
为 RWKV 设计的「Deep Think」实现。
☆26Dec 7, 2025Updated 7 months ago
YihongT / LLMSynthor
View on GitHub
☆21Jul 3, 2025Updated last year
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
View on GitHub
☆19May 17, 2025Updated last year
ghrua / NgramRes
View on GitHub
☆23Nov 6, 2022Updated 3 years ago
ETH-DISCO / audio-atlas
View on GitHub
☆15Feb 6, 2026Updated 5 months ago
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆16Nov 4, 2025Updated 8 months ago
HanGuo97 / log-linear-attention
View on GitHub
☆284Jun 6, 2025Updated last year
TIGER-AI-Lab / Hierarchical-Reasoner
View on GitHub
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]
☆64Apr 11, 2026Updated 3 months ago
allenai / bolmo-core
View on GitHub
Code for Bolmo: Byteifying the Next Generation of Language Models
☆136Jul 6, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
taugroup / ThinkTank
View on GitHub
Agentic Virtual Lab
☆20Nov 30, 2025Updated 7 months ago
NousResearch / nomos
View on GitHub
☆195Dec 18, 2025Updated 7 months ago
S-Forouzandeh / MACLA-LLM-Agents-AAMAS-2026-Conference
View on GitHub
Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement
☆18Jan 16, 2026Updated 6 months ago
BunsenFeng / FactKB
View on GitHub
Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.
☆20Dec 25, 2023Updated 2 years ago
KindXiaoming / newton-kepler
View on GitHub
Understand what physics/algorithms do transformers learn internally when trained on planetary motion
☆47Feb 9, 2026Updated 5 months ago
LLMkvsys / rethink-kv-compression
View on GitHub
☆24Mar 7, 2025Updated last year
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 11 months ago
HVision-NKU / MaskDiffusion
View on GitHub
☆12Dec 7, 2024Updated last year
aster2024 / SWIFT
View on GitHub
Source code for SWIFT, an efficient reward model.
☆21Jan 13, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
AI4Finance-Foundation / FinGPT-Earnings-Call-LLM-Agent
View on GitHub
☆18Apr 1, 2024Updated 2 years ago
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
kenchan0226 / FineGrainedFact
View on GitHub
Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…
☆15Jan 25, 2024Updated 2 years ago
Fuwn / suckless-agent-skills
View on GitHub
🥤 Minimal agent skills grounded in the suckless philosophy
☆18Apr 6, 2026Updated 3 months ago
sail-sg / Stable-RL
View on GitHub
Rethinking the Trust Region in LLM Reinforcement Learning
☆63Mar 2, 2026Updated 4 months ago
roychowdhuryresearch / gsw-memory
View on GitHub
Code corresponding to Generative Semantic Workspaces - Long term Structured Memory for Large Language Models - AAAI 26 (Oral), ICML 26
☆22Jun 2, 2026Updated last month