TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.
☆289Apr 8, 2026Updated this week
Alternatives and similar repositories for triattention
Users that are interested in triattention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆97Oct 23, 2025Updated 5 months ago
- super-resolution; post-training quantization; model compression☆14Nov 10, 2023Updated 2 years ago
- ☆30Jun 30, 2025Updated 9 months ago
- Minute-long video generation at 24FPS.☆61Mar 28, 2026Updated 2 weeks ago
- ☆49May 20, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- RealTime Canvas powered by fal.ai klein realtime api☆34Feb 18, 2026Updated last month
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆30Updated this week
- Official Implementation for NorMuon paper☆65Mar 11, 2026Updated last month
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆28Feb 17, 2025Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-perfo…☆106Feb 2, 2026Updated 2 months ago
- pruning vision models in torch☆17Dec 5, 2025Updated 4 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆64Mar 5, 2026Updated last month
- Denoising Diffusion Step-aware Models (ICLR2024)☆62Feb 6, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"☆73Jul 8, 2025Updated 9 months ago
- ☆13Nov 20, 2023Updated 2 years ago
- [ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference☆186Updated this week
- Long-term Research Assistants with Self-Scheduling☆53Mar 22, 2026Updated 3 weeks ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated 11 months ago
- ☆10Jan 19, 2018Updated 8 years ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆31Mar 22, 2026Updated 3 weeks ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Dec 1, 2023Updated 2 years ago
- Collection of UIs for ComfyUI☆61Mar 30, 2026Updated last week
- ☆11Apr 5, 2023Updated 3 years ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆53Mar 16, 2026Updated 3 weeks ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆110Oct 11, 2025Updated 6 months ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆27Jun 16, 2025Updated 9 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 5 months ago
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter☆162Feb 27, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Image classifier to help TensorFlow Lite C++ API usage with Bazel.☆12Oct 12, 2021Updated 4 years ago
- A computer vision project using YOLO-based object detection models to classify and analyze student behaviors in classroom settings. This …☆17Feb 12, 2025Updated last year
- Official code repository for the paper "Rethinking Model Prototyping through the MedMNIST+ Dataset Collection" @ Scientific Reports☆13Mar 5, 2025Updated last year
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆11Jul 9, 2025Updated 9 months ago
- [CVPR 2025] QuartDepth☆17Mar 24, 2025Updated last year
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Jul 20, 2023Updated 2 years ago
- ☆15Updated this week