3% Is All You Need: Breaking TurboQuant's Compression Limit via Spectral Structure
☆129Apr 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for spectralquant
Users that are interested in spectralquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Let Claude Code and Codex control your browser☆30Aug 30, 2025Updated 7 months ago
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- Mobile app and server for claude code☆23Apr 16, 2025Updated last year
- Claude Multi-Agent Project Management Framework - AI-driven orchestration with LangGraph and OpenAI integration☆29Jul 23, 2025Updated 9 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Nov 12, 2024Updated last year
- The official guide to the process of designing custom keycaps for mechanical keyboards, as maintained by the members & mods of the Keycap…☆22Jun 9, 2021Updated 4 years ago
- ☆28Feb 11, 2026Updated 2 months ago
- VS Code/Cursor extension for automating Claude Code tasks with intelligent queuing, batch processing, and auto-resume.☆228Aug 21, 2025Updated 8 months ago
- Go implementation of the Gun distributed graph database☆11Feb 26, 2019Updated 7 years ago
- ☆22Jan 13, 2025Updated last year
- MacOS Status bar app that tracks, monitors your Claude Code AI usage costs in real-time and compares them to human developer costs☆44Feb 15, 2026Updated 2 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- Understanding deep networks and large models.☆27Jan 23, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- LMDB Adapter for gunDB☆14Dec 8, 2022Updated 3 years ago
- ☆149Updated this week
- LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and cle…☆17Nov 11, 2024Updated last year
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT2 (~95M params). Fast, creative text generation tra…☆17Apr 17, 2026Updated last week
- Android Photo/Video Recording/Capture/Effects via OpenGL☆10Feb 21, 2021Updated 5 years ago
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- ☆54May 14, 2025Updated 11 months ago
- Chameleon: A Multiplier-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Da…☆27Mar 5, 2026Updated last month
- Firecracker VM orchestration for Claude Code sessions☆26Mar 30, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆23Mar 25, 2026Updated last month
- GunDB HTTP/HTTPS Server and API☆19Feb 15, 2018Updated 8 years ago
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 4 months ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Jun 5, 2024Updated last year
- ☆11Mar 2, 2024Updated 2 years ago
- Used FPGA board and System Verilog to design controller, DMA, pipelined SIMD processor, and GEMM accelerator☆12Aug 26, 2023Updated 2 years ago
- EmbedDB is an ultra-lightweight vector database designed for rapid prototyping of semantic search and RAG applications. The entire implem…☆21Mar 24, 2025Updated last year
- Web-based annotation tool for media data. The easiest way to create you own media dataset.☆16May 12, 2023Updated 2 years ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆16Feb 13, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Nov 12, 2024Updated last year
- Model Quantization Benchmark☆19Apr 17, 2026Updated last week
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.☆36Apr 19, 2026Updated last week
- A minimalist but optimized Python package for deduplication tasks leveraging RapidFuzz internally, enabling super-fast approximate duplic…☆18Apr 2, 2025Updated last year
- [CVPR 2025] Efficient Personalization of Quantized Diffusion Model without Backpropagation☆16Mar 31, 2025Updated last year
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆30Aug 4, 2024Updated last year
- Can Language Models Solve Olympiad Programming?☆124Jan 14, 2025Updated last year