[ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)
☆79Jan 27, 2026Updated last month
Alternatives and similar repositories for GRAPE
Users that are interested in GRAPE are comparing it to the libraries listed below
Sorting:
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆25Dec 1, 2025Updated 3 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated 11 months ago
- the open-source code of QAgent☆53Oct 14, 2025Updated 4 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆36Updated this week
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆35Jan 20, 2026Updated last month
- Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.☆63Dec 19, 2025Updated 2 months ago
- ☆43Jan 30, 2026Updated last month
- ☆21Jun 4, 2024Updated last year
- A microframework for creating command-line applications in Zig☆45Nov 1, 2025Updated 4 months ago
- ☆50Dec 11, 2025Updated 2 months ago
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 5 months ago
- Official Implementation of NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering☆68Dec 1, 2025Updated 3 months ago
- intelligence layer between human goals and AI execution☆57Updated this week
- 2019 딥러닝-비전처리 홀로서기 특강에 사용된 Lecture Note 및 Code Repository입니다.☆12Sep 7, 2019Updated 6 years ago
- Minimal Claude Code alternative powered by MLX☆45Jan 11, 2026Updated last month
- 공학수학 강의노트☆19Feb 27, 2024Updated 2 years ago
- Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.☆55Nov 12, 2025Updated 3 months ago
- A central registry and HTTP interface for coordinating Model Context Protocol (MCP) servers.☆34Dec 29, 2024Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 2 months ago
- ☆40Sep 24, 2025Updated 5 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆117Dec 17, 2025Updated 2 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆66Jan 13, 2026Updated last month
- ☆50Jun 16, 2025Updated 8 months ago
- ☆76Jan 8, 2026Updated last month
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Qwen Multi Angle UI powered by fal.ai API☆84Jan 22, 2026Updated last month
- Terminal Velocity Matching☆67Feb 14, 2026Updated 2 weeks ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆29Sep 25, 2021Updated 4 years ago
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- ☆51Oct 10, 2025Updated 4 months ago
- Awesome Triton Resources☆39Apr 27, 2025Updated 10 months ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆70Jan 12, 2026Updated last month
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆60Sep 15, 2025Updated 5 months ago
- documentation used in my projects☆16Updated this week
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆228Updated this week
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- Google ADK Training Hub: Build Production-Ready Google AI Agents in Days, Not Months 🚀 The only comprehensive Google ADK training with …☆76Feb 22, 2026Updated last week
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 7 months ago