SpectralQuant: Calibrated Eigenbasis Rotation and Water-Filled Bit Allocation for KV-Cache Compression
☆195May 15, 2026Updated last month
Alternatives and similar repositories for spectralquant
Users that are interested in spectralquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Let Claude Code and Codex control your browser☆30Aug 30, 2025Updated 9 months ago
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Dec 12, 2025Updated 6 months ago
- Claude Multi-Agent Project Management Framework - AI-driven orchestration with LangGraph and OpenAI integration☆30Jul 23, 2025Updated 11 months ago
- ☆10Nov 12, 2024Updated last year
- A bunch of my first droids☆255Nov 19, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Language modeling with linear-cost context☆118Sep 25, 2025Updated 9 months ago
- Voice synthesis library for Text-to-Speech applications (HTS Engine rewrite in Rust language)☆12Updated this week
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 10 months ago
- ☆15May 12, 2025Updated last year
- VS Code/Cursor extension for automating Claude Code tasks with intelligent queuing, batch processing, and auto-resume.☆235Aug 21, 2025Updated 10 months ago
- Go implementation of the Gun distributed graph database☆11Feb 26, 2019Updated 7 years ago
- Play Chrome's Dinosaur Game with Reinforcement Learning☆13Jun 25, 2023Updated 3 years ago
- An attribution library for LLMs☆46Sep 17, 2024Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS2024] "Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design", Ruisi Cai, Yeonju Ro, Geon-Woo …☆16Dec 16, 2024Updated last year
- Understanding deep networks and large models.☆29Jan 23, 2026Updated 5 months ago
- LMDB Adapter for gunDB☆14Dec 8, 2022Updated 3 years ago
- Open-source web scraping API. Turn any website into clean markdown or structured JSON. Anti-detect browser, proxy auto-selection, self-ho…☆174Jun 16, 2026Updated last week
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 22, 2026Updated last week
- ☆21Jun 12, 2024Updated 2 years ago
- Official Pytorch Code of the Paper "WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation"☆36Jun 19, 2026Updated last week
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and cle…☆19Nov 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dynamic FastMCP extends the Model Context Protocol Python server with context-aware tools that adapt their behavior and descriptions base…☆45Aug 26, 2025Updated 10 months ago
- Android Photo/Video Recording/Capture/Effects via OpenGL☆10Feb 21, 2021Updated 5 years ago
- PAct: Part-Decomposed Single-View Articulated Object Generation☆56May 12, 2026Updated last month
- A whisper repo for TPU☆11Jun 4, 2024Updated 2 years ago
- KAF : Kolmogorov-Arnold Fourier Networks☆22Feb 19, 2025Updated last year
- Coding-agent VM orchestrator: runs coding agents in isolated VMs — Firecracker micro-VMs on Linux (with ZFS-based audit-trail snapshots) …☆34Jun 12, 2026Updated 2 weeks ago
- Chameleon: A Multiplier-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Da…☆27Mar 5, 2026Updated 3 months ago
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆23Mar 25, 2026Updated 3 months ago
- ☆11Mar 2, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Jun 5, 2024Updated 2 years ago
- ☆310Apr 15, 2026Updated 2 months ago
- ☆17May 27, 2019Updated 7 years ago
- [ECCV 2024] CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs☆19Jul 2, 2024Updated last year
- EmbedDB is an ultra-lightweight vector database designed for rapid prototyping of semantic search and RAG applications. The entire implem…☆21Mar 24, 2025Updated last year
- Web-based annotation tool for media data. The easiest way to create you own media dataset.☆16May 12, 2023Updated 3 years ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆16Feb 13, 2022Updated 4 years ago